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AN  INTRODUCTION  TO  STATISTICAL  DESIGN  OF  EXPERIMENTS 
IN  METALLURGICAL  RESEARCH1 


S.  J.  Hussey,2  P.  L.  Placek,3  and  C.  H.  Schack4 


SUMMARY 

NThis  Bureau  of  Mines  report  presents  a discussion  of  the  statistical 
approach  to  the  design  of  experiments  and  indicates,  by  concrete  examples,  how 
and  when  statistical  design  is  essential  for  obtaining  meaningful  data  with  a 
minimum  number  of  experiments. 

Every  research  worker  plans  his  work  and  designs  his  experiments  in  a 
manner  which,  based  on  his  experience,  seems  most  rational  to  him  and  which  he 
believes  will  produce  the  information  essential  for  resolving  the  problems. 

If  the  results  obtained  are  open  to  multiple  interpretations  and  are  not  of 
definitive  reliability,  the  failure  of  the  experiment  design  to  produce  the 
desired  information  may  be  due  to  several  causes.  These  include  faulty  judg- 
ment in  selecting  the  factors  to  be  tested,  ignoring  the  importance  of  inter- 
action between  factors,  failure  to  adequately  control  test  variables,  and 
assuming  that  meaningful  trends  in  the  results  can  be  identified  without  com- 
parison to  a predetermined  experimental  error.  An  experienced  researcher 
probably  will  deduce  from  the  nature  of  the  data  obtained  that  the  design  is 
faulty  and  may  by  trial  and  error  ultimately  arrive  at  the  correct  solution  to 
the  problem.  Statistical  methods  of  planning  research  minimize  the  chance  of 
faulty  experiment  design,  embody  concrete  procedures  for  testing  the  validity 
and  reliability  of  the  test  data  and  for  indicating  mandatory  changes  in  the 
design,  and  enable  the  researcher  to  obtain  meaningful  information  with  the 
least  number  of  tests. 

The  classical  or  "Edisonian"  method  of  designing  a research  program  is 
the  essence  of  simplicity.  Each  factor  deemed  to  have  a significant  influence 
on  the  results  is  separately  investigated.  The  effect  of  each  factor  is  esti- 
mated by  a series  of  tests  which  vary  one  factor  over  a selected  range  of 
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values  while  holding  all  other  factors  constant.  Such  a design  of  the  experi- 
mental work  will  be  adequate  for  the  investigation  of  problems  involving  sim- 
ple relationships  or  for  research  directed  to  achieve  goals  readily  identified 
by  data  that  may  lack  specified  precision  and  reliability.  When,  as  is  common, 
the  research  involves  complex  interrelationships  between  many  diverse  factors, 
definitive  trends  and  conclusions  can  be  made  only  from  data  that  can  be  ana- 
lyzed and  tested  as  to  validity  and  significance.  An  experimental  design 
based  on  the  classical  method,  however,  will  not  detect  or  yield  estimates  of 
significant  interaction  effects  between  the  important  factors,  nor  can  the 
data  be  objectively  analyzed  to  provide  information  as  to  reliability. 

The  purpose  of  statistical  design  is  to  plan  experiments  so  that  the 
results  will  yield  meaningful  information  about  fundamental  or  empirical 
relationships.  Statistical  methods  embody  well  codified  designs  for  various 
types  of  experimental  conditions  and  also  provide  ways  to  determine  the  merits 
of  uuch  designs  for  providing  information  in  research.  Overdesign  and  under- 
desigu  are  inevitable  unless  suitable  criteria  are  used  to  judge  the  correct 
number  and  type  of  observations  to  make. 

The  designs  discussed  in  this  report  are  for  (1)  estimating  the  effect  of 
one  or  more  independent  variables  upon  a dependent  variable  and  (2)  defining 
an  equation  by  which  the  dependent  variable  can  be  estimated  from  values  of 
the  independent  variables.  In  the  statistical  approach  to  research,  the  pos- 
tulation that  the  dependent  and  the  independent  variables  determine  a hypo- 
thetical and  mathematical  space  provides  the  theoretical  setting  for  the 
development  of  mathematical  expressions  for  empirical  and  fundamental  relation- 
ships. The  use  of  statistical  principles  within  such  settings  facilitates  the 
development  of  the  desired  equations  that  represent  the  relationships  being 
investigated. 

One  of  the  more  important  statistical  designs  is  the  factorial  design  by 
means  of  which  several  factors  and  their  interactions  are  studied.  Two- level 
factorial  designs  are  particularly  applicable  in  preliminary  research  and  in 
the  determination  of  optinum  operating  conditions.  Factorial  designs  in  which 
more  than  two  levels  are  selected  for  some  or  all  of  the  factors  are  used  when 
a more  complete  investigation  of  the  relationships  between  factors  and 
response  is  desired.  When  main  effects  are  not  linear  over  the  region  of 
interest  and  interaction  effects  are  relatively  large,  more  than  two  levels 
for  the  factors  are  necessary  for  a complete  study  of  the  response. 

A fractional  factorial  design  may  be  used  when  some  of  the  interactions 
are  not  important.  In  such  a design  the  estimates  of  the  various  effects  are 
combined  so  that  only  the  important  effects  are  studied  in  an  experiment. 

When  the  estimates  of  effects  are  thus  combined,  they  are  said  to  be  con- 
founded. The  tests  which  are  sufficient  for  estimating  the  important  effects 
are  selected  from  the  corresponding  full  factorial  design.  Fractional  facto- 
rial designs  and  the  principle  of  confounding  also  are  useful  in  metallurgical 
research  when  it  is  necessary  to  introduce  minor  factors  into  an  experiment  by 
making  tests  in  two  or  more  sets  which  are  called  blocks.  Tests  are  assigned 
to  the  different  blocks  in  such  a way  that  the  effects  of  these  minor  factors 
will  be  confounded  with  unimportant  interaction  effects.  When  all  the 
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interactions  between  factors  are  important,  several  experiments  should  be  made 
in  which  the  effect  of  a difference  between  two  blocks  is  confounded  with  a 
different  interaction  effect  in  each  experiment.  Thus,  an  estimate  of  every 
interaction  effect  is  obtained  from  at  least  one  experiment.  This  procedure 
is  called  partial  confounding. 


The  selection  of  a sufficient  number  of  factor  levels  for  investigation 
v in  an  experiment  is  important.  When  insufficient  levels  are  selected,  impor- 

tant polynomial  components  that  are  not  taken  into  consideration  are  con- 
founded with  those  polynomial  components  for  which  the  design  was  prepared. 

The  estimates  of  the  latter  components  then  are  biased.  In  such  a situation 
an  economy  sometimes  can  be  effected  in  the  amount  of  experimental  work  by 
making  a few  additional  tests  properly  related  to  the  tests  of  the  basic  fac- 
torial or  fractional  factorial  design  without  making  a full  factorial  design 
of  higher  order.  Such  designs  are  termed  composite  designs  and  provide  the 
advantage  of  proceeding  with  experimental  work  by  natural  stages.  The  com- 
posite design  is  especially  helpful  in  the  determination  of  optimum  conditions 
by  taking  the  shortest  route  as  determined  by  a succession  of  two- level  facto- 
rial experiments. 

Nested  designs  generally  are  used  when  successive  sampling  is  necessary 
to  determine  the  variation  in  some  characteristic  that  contribures  to  the 
experimental  error.  By  means  of  nested  designs  the  total  experimental  error 
may  be  divided  into  components  corresponding  to  the  errors  contributed  by  sev- 
eral sources.  Multiple  replication  at  any  level  in  a design  is  made  to 
increase  the  precision  of  estimating  effects  and  should  not  be  used  solely  to 
obtain  a better  estimate  of  experimental  error. 

When  an  experiment  is  designed  by  statistical  methods,  the  experimental 
errors  that  are  associated  with  each  observation  are  assumed  to  be  random 
variables  that  are  uncorrelated  between  replicates  and  between  factor  levels. 
Correlation  due  to  factors  that  cannot  be  controlled  may  be  minimized  by  ran- 
domization in  the  execution  of  the  experiment.  Correlation  between  errors  may 
cause  trends  in  the  results  that  contribute  to  the  estimates  of  important 
effects. 

Precise  formulation  of  experiment  objectives  and  adequate  design  of 
experiments  followed  by  capable  analysis  of  data  are  characteristics  of  well 
executed  research.  One  of  the  most  valuable  benefits  resulting  from  the  use 
of  statistical  methods  is  an  enforced  discipline  of  thought  leading  to  a 
precise  formulation  of  research  problems  and  experimental  objectives.  This 
is  the  foundation  upon  which  the  statistical  design  of  experiments  is  based. 

A later  report  is  planned  to  consider  the  methods  used  in  analyzing  data  from 
statistically  designed  experiments. 

INTRODUCTION 

The  studies  herein  reported  were  undertaken  as  a part  of  a broader  Bureau 
of  Mines  program  on  the  application  of  statistical  methods  to  planning  and 
evaluating  research  in  extractive  metallurgy.  Research  has  become  Increasingly 
dependent  upon  the  services  of  skilled  and  semiskilled  technicians  guided  by  a 
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limited  professional  staff.  Therefore,  it  has  become  imperative  to  utilize 
every  possible  means  of  increasing  the  efficiency  and  effectiveness  of  plan- 
ning, directing,  conducting,  and  evaluating  the  research  program.  This  report 
is  intended  primarily  for  the  benefit  of  research  workers  with  professional 
training  who  must  first  design  and  direct  experiment  work  and  then  evaluate 
the  resulting  data  in  terms  of  comprehensive  and  meaningful  results. 

Statistical  theory  is  a branch  of  applied  mathematics  that  has  its  roots  * 

in  the  area  of  pure  mathematics  known  as  the  theory  of  probability.  In  apply- 
ing statistical  methods,  the  research  worker  uses  mathematical  tools  that  have 
been  devised  to  facilitate  the  analysis  of  data  and  the  design  and  analysis  of  t 

various  types  of  experiments.  Most  books  written  on  statistical  methods  and 
their  practical  application  are  couched  in  formidable  mathematical  terminology 
or  are  so  abstruse,  due  to  the  manner  of  presenting  the  illustrative  material, 
that  a logical  association  of  the  more  important  and  useful  principles  can  be 
obtained  only  after  extensive  study.  An  attempt  has  been  made  in  this  report 
to  avoid  discouraging  the  researcher  who  has  only  an  ordinary  background  in 
college  algebra  and  an  elementary  background  in  calculus. 

The  salient  useful  features  of  statistical  methods  are  the  controls  which 
can  be  exercised  over  both  experiments  and  deductions  with  a specified  degree 
of  probability.  A lack  of  sufficient  control  can  be  determined  and  the  neces- 
sary corrective  or  precautionary  measures  indicated  by  statistical  methods. 

These  useful  features  of  statistical  methods  do  not  obviate  the  necessity  for 
the  research  worker  to  exercise  proper  judgment  in  selection  of  the  statisti- 
cal method  best  adapted  to  the  particular  problem  and  in  the  interpretation  of 
the  results  obtained  by  statistical  analysis.  The  decisions  and  conclusions 
based  upon  statistical  analysis  must  be  properly  integrated  with  the  technical 
knowledge  peculiar  to  each  investigation. 

In  presenting  a coherent  discussion  of  so  complex  a subject  as  the  appli- 
cation of  statistical  methods  to  the  design  of  metallurgical  research,  consid- 
erable simplification  has  been  necessary.  Only  those  elements  of  probability 
theory  sufficient  to  enable  the  reader  to  understand  the  principles  involved 
in  statistical  design  are  presented.  Words  that  have  a unique  meaning  in 
statistical  mathematics  are  defined.  Special  attention  is  given  to  presenting 
the  symbolic  language  of  statistics  in  a consistent  and  clear  manner  readily 
coordinated  with  that  used  in  important  references  cited  in  this  report. 

Finally,  the  report  attempts  to  illustrate  by  means  of  concrete  examples  of  » 

metallurgical  problems  when  and  how  different  statistical  methods  can  be  used 
to  design  the  simplest  and  most  efficient  experiment  program  that  will  produce 
meaningful  data  readily  analyzed  to  derive  conclusions  with  a specified  degree 
of  probability. 


DESIGN  OF  EXPERIMENTS 

A good  experimental  design  is  one  of  three  prerequisites  to  efficient 
research  that  will  enable  the  researcher  to  obtain  the  information  desired 
with  increased  effectiveness  and  with  a minimum  expenditure  of  time.  The 
three  prerequisites  are  (1)  a correct  formulation  of  the  required  information, 
(2)  a correct  choice  of  experimental  method,  after  consideration  of  required 


accuracy  and  experimental  pitfalls,  and  (3)  a correct  choice  of  the  number  and 
interrelation  of  the  individual  tests  which  characterize  the  design  of  the 
whole  experiment.  These  prerequisites  should  be  considered  in  the  sequence 
listed,  since  the  first  two  steps  will  affect  the  usefulness  of  the  design. 

Statistical  design  basically  deals  with  the  third  prerequisite  for 
obtaining  maximum  results  from  a minimum  of  experimental  work.  Statistical 
methods  have  provided  (1)  well  codified  designs  for  various  types  of  experi- 
mental conditions,  (2)  means  for  determining  the  relative  merits  of  such 
designs,  and  (3)  under  certain  conditions,  estimates  concerning  the  necessary 
number  of  observations  to  be  made  in  an  experiment.  An  overdesigned  experi- 
ment with  too  many  observations  results  in  useless  labor.  An  underdesigned 
experiment  with  too  few  observations  may  lead  to  false  conclusions.  Over- 
design  and  underdesign  are  inevitable  unless  a suitable  criterion  is  used  to 
judge  the  correct  number  of  observations  to  make.  The  economies  of  correct 
design  and  a proper  number  of  observations  outweigh  the  extra  time  and  thought 
required  in  planning  the  experiment. 

In  statistical  terminology,  dependent  variables  are  called  responses 
because  they  are  results  which  respond  in  some  manner  to  changes  in  the  inde- 
pendent variables.  In  any  one  experiment  there  is  usually  either  one  such 
response,  or  else  the  results  of  the  experiment  can  be  evaluated  independently 
with  respect  to  each  response.  Hie  independent  variables  which  affect  the 
response  are  called  factors.  Usually  many  factors  influence  the  response 
under  investigation,  and  it  is  essential  that  all  of  them  be  given  due  consid- 
eration. Each  factor  either  should  be  investigated  for  its  effect  upon  the 
response  or  else  should  be  controlled  so  that  the  influence  of  those  factors 
that  are  investigated  can  be  independently  studied.  Due  to  practical  limita- 
tions those  factors  that  are  considered  to  have  the  more  important  effects 
upon  the  response  usually  are  selected  for  study.  Factors  such  as  time,  tem- 
perature, and  concentration  are  called  quantitative  factors  because  they  can 
be  classified  according  to  some  scale.  Factors  such  as  operators,  machines, 
shifts,  type  of  reagent,  and  type  of  ore  that  cannot  be  defined  by  a scale  are 
named  qualitative  factors. 

Classifications  of  a factor  in  an  experiment  are  called  levels.  A partic- 
ular value  of  the  response  obtained  in  an  experiment  is  called  an  observation. 
In  statistical  terminology  the  combination  of  all  the  factor  levels  in  an 
experiment  that  produces  a particular  observation  is  called  a treatment  but  is 
herein  referred  to  as  a test.  An  experiment  consists  of  all  the  tests  that 
are  included  in  the  design  in  order  to  obtain  the  desired  information. 

For  the  purposes  of  estimating  the  experimental  error  and  of  obtaining 
experimental  precision,  it  is  often  necessary  to  repeat  each  test  two  or  more 
times.  An  experiment  in  which  every  test  is  performed  once  is  called  a single 
replicate.  If  every  test  is  performed  n times,  then  the  experiment  is  said  to 
involve  n replicates.  The  process  of  repeating  an  experiment  or  a particular 
test  is  called  replication. 

The  terms  response,  factor,  quantitative,  qualitative,  level,  test, 
observation,  experiment,  and  replicate  are  a convenient  means  of  reference  for 
the  detailed  discussion  of  experimental  designs  and  the  analysis  of  results. 


The  best  design  for  an  experiment  requires  prior  knowledge  of  the  errors 
involved  and  the  size  of  the  effect  it  is  necessary  to  detect.  Even  sketchy 
information  is  better  than  none.  When  statistical  design  is  the  operating 
method  of  a research  center,  a store  of  such  knowledge  will  rapidly  accumulate 
and  provide  a fund  of  knowledge  for  later  design  work.  When  such  knowledge  is 
sketchy  at  the  start  of  a research  program,  further  information  usually  is 
acquired  during  the  course  of  the  work  if  proper  design  has  been  made.  These 
interim  results  can  be  used  to  provide  a more  substantial  basis  for  decisions 
about  subsequent  design  and  number  of  observations.  The  advantages  of  this 
sequential  approach  are  quite  marked  when  it  is  used  in  a systematic  manner. 

The  designs  that  are  discussed  are  for  the  purpose  of  estimating  the 
effect  of  one  or  more  independent  variables  upon  a dependent  variable  and  in 
many  cases  to  ultimately  determine  an  estimated  equation  by  means  of  which  the 
dependent  variable  can  be  estimated  from  values  of  the  independent  variables. 
Such  designs  are  based  upon  the  assumption  that  observing  an  apparent  effect 
at  specific  values  of  the  independent  variables,  with  due  regard  to  experimen- 
tal error,  is  sufficient  grounds  to  conclude  that  the  dependent  variable  will 
be  similarly  affected  for  all  values  in  the  range  of  the  independent  variables 
involved.  There  is  another  class  of  problems  in  which  the  main  interest  is  in 
determining  the  existence  of  a relationship  between  two  or  more  variables  over 
a specified  range  with  a specified  degree  of  statistical  certainty.  The  sta- 
tistical procedure  for  determining  these  relationships  is  known  as  correlation. 

Mathematical  Representation  of  Natural  Relationships 

The  purpose  of  statistical  design  is  to  provide  plans  for  experiments  such 
that  when  the  experiments  are  properly  executed  the  results  will  yield  maxi- 
mum information  about  fundamental  relationships  or  about  empirical  relation- 
ships that  approximate  fundamental  relationships  in  a restricted  region  of  the 
factors.  These  relationships  often  are  so  complex  that  a study  of  the  under- 
lying causes  of  all  observed  effects  would  involve  a prohibitive  amount  of 
work.  In  such  cases  statistical  methods  usually  can  be  employed  to  develop 
suitable  mathematical  expressions  that  represent  the  natural  relationships 
by  means  of  approximating  empirical  expressions.  Such  expressions  provide  a 
convenient  means  of  determining  optimum  conditions  for  a desired  response,  a 
means  of  predicting  values  of  the  response  for  specific  factor  combinations, 
and  a means  of  determining  the  relative  importance  of  the  factors  investigated. 

The  Response  Surface 

In  general,  the  objective  of  research  is  to  determine  the  relationships 
between  a response  and  associated  factors  in  a restricted  region  bounded  by 
convenient  values  of  the  factors.  In  the  statistical  approach  to  research, 
the  response  under  consideration  and  the  factors  chosen  for  investigation  may 
be  considered  as  variables  which  determine  a hypothetical  and  mathematical 
space.  Suppose  the  problem  is  to  define  the  relationship  between  the  amount 
of  manganese  sulfate  remaining  in  solution  and  the  amount  precipitated,  as 
affected  by  the  temperature  and  pH  of  the  solution.  The  manganese  sulfate 
remaining  in  solution,  the  solution  pH,  and  the  solution  temperature  may  be 
represented  by  axes  at  right  angles  as  shown  in  figure  1. 
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FIGURE  1.  - The  Geometrical  Representation  of  a Response  and  Two  Factors. 


The  axes  representing  manganese  sulfate  in  solution,  solution  pH,  and 
solution  temperature  in  figure  1 define  a hypothetical  three-dimensional  space. 
If  another  factor,  such  as  pressure,  were  investigated  for  its  effect  upon  the 
amount  of  manganese  sulfate  in  solution,  four  axes  would  be  necessary  for  a 
similar  representation,  and  the  problem  could  not  be  viewed  geometrically. 
However,  in  the  latter  case  it  is  convenient  to  think  of  the  problem  as  exist- 
ing in  a hypothetical  four-dimensional  space.  The  postulation  of  such  mathe- 
matical spaces  provides  a theoretical  setting  within  which  the  development  of 
mathematical  expressions  for  empirical  and  fundamental  relationships  is  made 
easier.  The  use  of  statistical  principles  within  such  settings  facilitates 
the  development  of  the  desired  equations  which  represent  the  relationships 
being  investigated. 

If  values  of  manganese  sulfate  in  solution  were  plotted  against  values  of 
solution  pH  and  solution  tenperature  in  figure  1,  a curved  surface  would  be 
obtained.  Such  a geometrical  or  hypothetical  representation  of  the  relation- 
ship between  a response  and  the  associated  factors  is  called  a response  sur- 
face. In  a problem  involving  two  factors,  Xa  and  Xb,  the  response  also  may  be 
visualized  as  contour  lines  on  a plane  fixed  by  the  axes  representing  the  two 
factors.  This  is  illustrated  in  figure  2.  The  contour  lines  with  successive 
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values  of  50,  60, 
70,  and  80  percent 
represent  hypothet- 
ical recoveries  of 
manganese  due  to 
the  factors  Xa  and 
Xjj  in  a flotation 
process.  In  a 
problem  involving 
three  factors,  Xa, 
Xj,,  and  Xf.,  the 
relationships  can 
be  visualized  geo- 
metrically by 
using  three  axes 
for  the  three  fac- 
tors and  represent- 
ing the  response 
by  means  of  con- 
tour lines  on 
planes  perpendicu- 
lar to  one  of  the 
factor  x0  axes  representing 

FIGURE  2.  - A Contour  Surface  in  a Two-Factor  Space.  the  factors.  Ihis 

is  illustrated  in 

figure  3.  When  contour  lines  are  used,  each  contour  line  is  appropriately 
labeled  with  the  response  value  it  represents.  An  advantage  of  contour  repre- 
sentation is  that  attention  is  focused  on  the  levels  of  the  factors,  and  the 
response  can  be  imagined  without  the  distraction  of  another  dimension  for  the 
response.  In  the  case  of  four  or  more  factors,  geometrical  representation  is 
not  possible,  and  mathematical  representation  of  th-  hypothetical  response 
surface  must  be  relied  upon. 

If  there  are  p factors  in  an  experiment,  the  mathematical  existence  of  a 
p-dimensional  space  corresponding  to  the  p factors  is  implied.  This  mathemati- 
cal space  is  called  the  factor  space.  Within  the  factor  space  there  usually  is 
an  approximately  defined  region  corresponding  to  factor  combinations  of  poten- 
tial interest.  This  region  is  defined  by  the  upper  and  lower  limits  of  the 
factors  which  are  of  interest  to  the  research  worker.  This  region  is  called 
the  experimental  region  and  is  the  region  of  the  factor  space  within  which  the 
research  work  will  be  confined,  although  there  may  not  be  any  single  experi- 
ment that  will  span  the  whole  region. 

Polynomial  Representation 

Theoretically  any  response  surface  can  be  represented  by  some  mathematical 
expression.  The  proper  combination  of  the  factors  that  produces  various 
levels  of  the  response  may  be  represented  by  a mathematical  expression  vrtiich 
is  known  as  the  response  function.  In  this  paper  a factor  is  represented  by 
the  letter  X with  appropriate  letter  subscripts.  The  general  expression  for 
the  response  obtained  in  an  experiment  is  the  letter  y.  This  response  is 
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FIGURE  3.  - Contour  Surfaces  in  a Three-Factor  Space. 


statistically  considered  to  be  an  estimate  of  the  true  response,  which  is 
represented  by  the  letter  Y.  The  true  response  may  be  considered  as  the 
response  that  could  be  determined  if  an  infinite  number  of  experiments  could 
be  and  were  made  for  that  purpose.  If  Xa,  Xb,  Xk  are  the  factors,  the 

relationship  between  the  true  response  and  the  factors  can  be  briefly 
expressed  as  follows: 

• Y = U(Xa,  Xb,  . ..,  Xk).  (1) 

Equation  (1)  says  that  Y is  a function,  U,  of  the  factors  Xa,  Xb,  ...,  Xk. 

The  function  U is  called  the  true  response  function.  The  estimated  response 
function  is  represented  by  the  letter  u and  the  relationship  between  the  esti- 
mated response,  y,  and  the  estimated  response  function  may  be  briefly 
expressed  as  follows: 

y = u(Xa,  Xb,  •••»  Xk).  (2) 

The  true  response  function,  U,  may  represent  for  a one-factor  problem  an 
exponential  relationship  such  as  that  shown  in  figure  4 which  is  represented 
by  the  equation, 

Y = U(Xa)  = Me"xa,  (3) 

M = a constant. 
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When  the  true  response  can  be  represented  by  a smooth  continuous  curve  within 
the  experimental  region  such  as  shown  in  figure  4,  the  function  which  the 
curve  represents  may  be  expressed  in  the  equivalent  form  of  a polynomial  with 
an  infinite  number  of  terms;  namely, 

Y = U(Xa)  = Mq  + MaXa  + MaaX|  + MaaaX^  + ...»  (4) 

Mq,  Ma,  Maa,  ...  = suitable  constants. 

For  practical  purposes  the  true  response  can  be  represented  with  an  abbrevi- 
ated number  of  terms  taken  from  the  first  part  of  the  series  as  follows: 

Y = U(Xa)  = M0  + MaXa  + MaaX|  + ...  + Maa><>axJ.  (5) 

The  number  of  terms  necessary  to  adequately  fit  the  true  response  in 
equation  (5)  depends  upon  the  range  of  the  values  of  Xa.  For  a sufficiently 
narrow  range  not  including  a turning  point  in  the  curve,  a good  approximation 
is  supplied  by  the  straight  line, 

Y = Mq  + MaXa.  (6) 

For  two  factors  Xa  and  Xb  the  approximating  polynomial  equation  would  be 

1 - Mo  + MA  + Mbxb  + * Mbb*i  + «.bVb  + "a..*3, 

+ »bbb^ + ••• + + "bb.^b^-  <?> 

As  in  the  case  of  a single  factor  if  the  region  of  the  response  surface  to  be 

represented  is  small  and  not  near  a turning  point,  then  the  terms  of  degree  1 

might  be  sufficient  to  represent  the  region.  In  this  case  the  equation  would 
be 

(8) 

Equation  (8)  is  the 
equation  of  a plane  in 
which  Mg  represents 
the  slope  of  the  plane 
in  the  direction  of 
the  factor  Xa,  and  Mj, 
represents  the  slope 
of  the  plane  in  the 
direction  of  the  fac- 
tor Xfc.  For  la.ger 
regions  or  for  small 
regions  near  a turning 
point,  terms  of  higher 
degree  would  have  to  be 
used  in  order  to  take 
into  account  the  curva- 
ture of  the  response 
surface. 


Y = M0  + MaXa  + MbXb. 


FACTOR  X0 


FIGURE  4.  - Graph  of  Response  Y for  Factor  Xn  for  the 
Exponential  Relationship,  Y ^Me'^a. 
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Occasionally  it  is  possible  to  transform  a complex  mathematical  relation- 
ship for  a specific  problem  into  a simplified  form  \dien  prior  research  has 
indicated  that  functions  of  the  dependent  or  independent  variables  can  be  used 
as  the  response  or  factors,  respectively,  in  the  design  and  analysis  of  an 
experiment.  The  use  of  a function  of  an  independent  variable  is  illustrated 
by  the  hypothetical  equation. 


Y = Me"T.  (9) 

* 

Suppose  that  Y is  the  amount  of  manganese  sulfate  remaining  in  solution  after 
thermal  precipitation  in  an  autoclave,  that  T is  temperature  ° C. , and  that 
equation  (9)  has  been  established  from  prior  research.  Let  it  be  assumed  that 
further  research  is  being  made  to  investigate  the  relationship  between  Y,  T, 
and  another  factor  which  is  solution  pH.  In  the  investigation  it  would  be 
reasonable  and  also  helpful  to  assume  the  equation,  Y = U(Xa,  X^) , wherein 
Xa  = e"T  rather  than  Xa  = T.  The  variable  e“T  would  be  used  as  a factor  in 
both  the  design  and  analysis  of  the  experiments  conducted.  In  the  actual  con- 
duct of  the  experiments,  the  values  of  T corresponding  to  the  values  of  e"T  in 
the  designs  would  be  used.  The  advantage  of  this  approach  is  that,  in  general, 
it  makes  possible  a closer  approximation  to  the  response  surface  with  a smaller 
number  of  terms  than  if  temperature  itself  were  used  as  a factor.  The  fewer 
the  terms  in  the  response  function,  the  smaller  will  be  the  amount  of  experi- 
mental work  needed  to  estimate  the  values  of  the  coefficients,  M^,  such  as 
those  in  equation  (7). 

An  example  of  the  advantageous  use  of  a function  of  the  dependent  vari- 
able for  the  response,  Y,  in  a hypothetical  flotation  experiment  follows: 
Suppose  that  R is  the  recovery  of  manganese  by  flotation,  that  factor  Xa  is 
conditioning  time,  that  factor  Xb  is  temperature  0 C. , and  that  the  product 
variable  XaXb  is  considered  to  be  important  in  the  response  function.  The 
true  response  equation  then  would  be  expressed  as 

R = Mq  + MaXa  + MbXb  + MabXaXb.  (10) 

However,  under  certain  conditions  involving  undesirable  variations  in  the 
experimental  error,  a different  choice  of  response  will  eliminate  the  undesir- 
able variation  in  the  experimental  error  and  also  eliminate  the  necessity  for 
a product  term  in  the  response  function.  In  such  a case  the  true  response 
function  would  be 


Y = Mq  + MaXa  + MbXb,  (11) 

wherein  Y = f(R)  is  a function  of  the  recovery  which  makes  a suitable  choice 
of  response.  The  in  equation  (11)  will  not  be  the  same  as  the  in  equa- 
tion (10).  If  such  information  is  considered  before  designing  an  experiment, 
the  amount  of  work  necessary  to  conclude  the  research  may  be  reduced,  espe- 
cially when  more  than  two  factors  are  involved.  Examples  of  functions  which 
might  be  used  in  transformations  of  the  response  are  logarithms,  exponentials, 
sines,  cosines,  tangents,  reciprocals,  square  roots,  and  so  forth. 


Separation  of  Response  Functions  Into  Independent 
Contributions  of  the  Factors 

A useful  tool  of  statistical  analysis  is  a procedure  for  determining  the 
importance  of  the  contribution  \diich  the  factors  in  a response  function  make 
to  the  overall  variation  in  the  response.  This  procedure  is  possible  because 
a response  function  can  be  separated  into  parts  vdiich  represent  the  total 
independent  contribution  of  each  factor  to  the  response.  The  total  contribu- 
tion of  a factor  to  the  response  is  called  the  effect  qf  the  factor.  The 
effect  of  a factor  may  be  further  divided  into  fractions  \diich  can  be  independ- 
ently associated  with  the  successive  powers  of  the  factor  in  a response  func- 
tion such  as  those  in  equations  (7)  and  (8).  These  fractions  of  the  effects 
are  herein  called  polynomial  components  of  the  effects.  Polynomial  components 
also  may  be  statistically  analyzed  to  determine  the  importance  of  the  varia- 
tion in  response  that  may  be  associated  with  specific  powers  of  the  factors 
included  in  the  response  function.  The  latter  procedure  is  helpful  in  deter- 
mining the  nature  of  the  designs  that  are  necessary  to  conclude  additional 
investigations  as  quickly  as  possible  with  the  greatest  possible  benefit  in 
the  form  of  comprehensive  information. 

In  any  experiment  the  evaluation  of  the  influence  of  a factor  upon  the 
response  requires  the  selection  of  some  standard  with  which  the  response  at 
individual  levels  of  the  factor  can  be  compared.  In  a simple  one-factor 
experiment  at  two  levels,  a customary  comparison  is  the  comparison  of  the 
response  at  the  higher  level  with  that  at  the  lower  level.  For  example,  if 
the  flotation  recovery  of  zinc  using  2 minutes'  conditioning  time  is  85  per- 
cent and  the  recovery  using  3 minutes'  conditioning  time  is  89  percent,  the 
metallurgist  states  that  the  effect  of  increasing  the  conditioning  time  1 min- 
ute is  to  increase  the  zinc  recovery  by  4 percent.  The  response  at  the  lower 
level  serves  as  a standard  of  comparison  for  the  response  at  the  higher  level. 
If  an  additional  test  were  made  for  zinc  recovery  using  4 minutes'  condition- 
ing time,  then  there  would  be  three  possible  comparisons  instead  of  one.  For 
design  and  analysis,  it  is  convenient  to  have  a single  standard  against  which 
the  responses  at  all  levels  of  all  factors  can  be  compared.  In  statistical 
methods  the  standard  which  has  been  adopted  for  ease  and  universality  of  ref- 
erence is  the  mean  value  of  the  responses  obtained  for  every  test  in  an 
exper iment . 

In  this  report  the  response  obtained  for  an  individual  test  is  repre- 
sented by  the  letter  y with  appropriate  subscripts  representing  the  levels  of 
the  factors  and  the  particular  replication  of  the  test.  For  example,  let  the 
conditioning  time  in  the  zinc  flotation  work  be  factor  Xa,  with  levels  k = 1, 

2,  3 corresponding  to  2,  3,  and  4 minutes  of  conditioning  time,  respectively. 
Let  the  replications  of  the  experiment  be  r = 1,  2.  The  zinc  recovery  for  4 
minutes'  conditioning  time  and  the  second  replicate  is  represented  by  y3s. 

In  general,  the  zinc  recovery  for  the  k'th  level  of  conditioning  time  and  the 
r'th  replicate  is  represented  by  y^,..  A bar  above  a symbol  represents  a mean 
value  of  the  quantity  represented  by  the  symbol.  The  mean  value  of  the  two 
replicate  flotation  tests  with  4 minutes'  conditioning  time  is  represented  by 
y3 , and  the  mean  value  for  the  k'th  level  is  represented  by  y^.  The  mean 
value  of  all  observations  in  the  experiment  is  represented  by  y.  The 
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difference  between  the  average  zinc  recoveries  at  4 minutes'  conditioning  time 
and  the  average  of  all  the  zinc  recoveries  is  called  the  effect  of  condition- 
ing time  at  the  third  level  and  is  represented  by  A3.  The  effect  of  condi- 
tioning time  at  the  third  level  is  calculated  as  follows: 


In  general, 


Ag=y3"y* 

Ak  = yk  - y. 


(12) 

(13) 


The  letter  A in  the  symbol  Ak  is  the  capital  of  the  letter  a in  the  subscript 
of  the  symbol  Xa  representing  conditioning  time.  With  three  levels  of  condi- 
tioning time  the  effects  A^  can  be  divided  into  linear  components  and  quad- 
ratic components  represented  by  Ak  and  A^,  respectively.  This  correlation  of 

symbols  for  effects  and  symbols  for  factors  is  followed  throughout  this  report. 


The  assumed  true  response  function  for  the  experiment  is 


M + MX  + M X^ . 
o a a aa  a 


(14) 


The  quantities  Ak  and  A^  represent  the  components  of  the  effect  of  factor  Xa 
at  the  k'th  level  associated  with  the  first  and  second  powers  of  Xa  in  equa- 
tion (14). 


FIGURE  5.  • Recovery  Versus  Conditioning  Time  in  a 
Three-Level  Experiment. 


In  gen- 
eral, if  there  are 
p levels  of  a fac- 
tor, it  is  possi- 
ble to  divide  the 
effect  of  the  fac- 
tor into  (p  - 1) 
components.  The 
fundamental  nature 
of  polynomial  com- 
ponents and  the 
basic  procedures 
for  calculating 
them  now  will  be 
illustrated  for 
linear,  quadratic, 
and  cubic  compo- 
nents. Let  the 
average  values  of 
the  zinc  recover- 
ies at  2,  3,  and  4 
minutes'  condi- 
tioning time  be, 
respectively,  yx  - 
85  percent,  yg  * 

89  percent,  and 
y3  - 91  percent, 
with  an  average  of 
y = 88-1/3  percent. 


The  determination  of  Independent  estimates  of  the  polynomial  components  is 
accomplished  by  first  shifting  the  factor  origins  to  the  center  of  the  experi- 
ment from  which  the  effects  are  measured.  If  the  algebraic  origin  of  the  fac- 
tor Xa  is  shifted  from  0 to  3 minutes,  the  zinc  recoveries  then  may  be  repre- 
sented as  shown  in  figure  5. 

There  are  three  ways  of  measuring  linear  changes  in  response  due  to 
changes  in  conditioning  time,  and  these  are  indicated  by  the  three  lines  join- 
ing the  points  in  figure  5.  The  measures  of  the  linear  changes  in  response 
are  y3  - y2  , y3  - ys , and  y2  - y1  . From  these  three  measures  of  the  linear 
component  an  average  measure,  A,  of  the  linear  component  in  the  experiment  can 
be  obtained.  The  sum  of  these  three  quantities  is 

(y3  - ?i ) + (?3  - ?a)  + (?a  - ?j ) = 2?3  - 2?i  = 2(?3  - ?i )•  05) 

The  proper  divisor  to  obtain  the  average  measure  of  the  linear  component  needs 
to  be  ascertained.  The  quantity  (y3  - yx ) in  equation  (15)  is  equal  to 
(y3  - y £ ) + (y2  - yx ).  Therefore,  equation  (15)  may  be  written 

2(ys  - ya)  + 2(y2  - yx  ) = 2(y3  - ya  ).  (16) 

From  equation  (16)  the  quantity  2(yg  - yx ) is  seen  to  be  the  sum  of  four  basic 
measures  of  the  linear  component.  Therefore,  the  proper  divisor  is  4 and  the 
average  measure  of  the  linear  component  is 

A = 2iy3„.:  = ( l/2)(y3  - y ) = (1/2)(91  - 85)  = 3.  (17) 

4 

The  average  measure,  A,  of  the  linear  component  of  the  effect  for  4 min- 
utes is  effective  over  the  range  between  the  mean  level  of  3 minutes  and  the 
4-minute  level.  The  linear  component,  Ag , for  the  recovery  y3  at  4 minutes' 
conditioning  time  is  equal  to  the  product  of  the  average  measure.  A,  of  the 
linear  component  times  the  distance,  (4  - 3)  = 1,  over  which  it  is  effective. 
The  linear  component,  Ag , is  therefore 


As 

In  like  manner  the  linear 

- (+1)A  = ( l/2)(y3  - y,)  = 3. 

components  for  yx  and  y2  are 

(18) 

= (-l)A  = ( — 1 / 2 ) ( y3  - yx  ) = -3, 

(19) 

Ap 

* (0)A  = 0. 

(20) 

The  average  measure.  A2,  of  the  quadratic  component  due  to  changes  in 
conditioning  time  can  be  likewise  calculated.  The  quantity  (y3  - yg)  is  the 
total  effect  of  changing  the  conditioning  time  from  3 to  4 minutes.  Hie 
linear  component  for  this  change  is  (4  - 3)A  ■ A,  and  the  difference  between 
the  total  effect  and  the  linear  component  is  a measure  of  the  quadratic 
component;  that  is, 

(?3  “ ya)  ” A “ (?S  ■ ya)  - 0/2)(y3  - y:  ) = (l/2)(y3  - 2y2  + yx  ).  (21) 
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Similarly  a measure  of  the  quadratic  component  is  obtained  for  the  difference 
between  the  recoveries  y3  and  yx  ; namely, 

-[  ( ys  - yx)  - a]  = -[(ys  - yx ) - ( i/2)(y3  - yx)] 

= (l/2)(y3  - 2ys  + y,  ).  (22) 

The  average  measure,  A3,  of  the  quadratic  component  is  the  mean  of  the  right- 
hand  quantities  in  equations  (21)  and  (22);  that  is, 

A3  - ( l/2)(y3  - 2ya  + yx)  = (1/2)[91  - 2(89)  + 85]  = -1.  (23) 

Equation  (23)  may  be  rewritten  as 

A3  = ( l/2)(y3  - 2y3  + yx  ) = ( l/2)[(y3  - ya ) - (ya  - yx  )].  (24) 

From  equation  (24)  it  is  seen  that  the  average  measure  of  the  quadratic  compo- 
nent of  the  effect  estimated  from  three  equally  spaced  adjacent  points  of  an 
experiment  is  equal  to  one-half  the  difference  between  the  two  adjacent  meas- 
ures of  the  linear  component.  In  general,  it  may  be  said  that  the  average 
measure  of  the  quadratic  component  of  the  effect  for  three  equally  spaced 
values  of  the  factor  Xa  is  equal  to  the  difference  between  the  two  adjacent 
basic  measures  of  the  linear  component,  calculated  from  pairs  of  adjacent 
points,  divided  by  the  number  of  factor  units  spanned  by  the  three  points. 

It  now  is  desirable  to  show  the  relationship  between  the  average  measure, 
A3 , of  the  quadratic  component  and  the  quadratic  components  A3 , A3 , and  A|  for 
the  recoveries  , ya  , and  y3 , respectively.  From  equation  (12)  the  effect  of 
conditioning  time  on  the  recovery  y3  is  Ag  = y3  - y.  The  difference  between 
the  effect  Ag  and  the  linear  component,  Ag , must  be  the  quadratic  component, 

A| ; that  is, 

A|  = Ag  - Ag  = (y3  - y)  - ( l/2)(y3  - y, ) 


= v - + h + y3  . 


( l/2)y3  + (1/2)?, 


Similarly, 


= (l/6)y3  - (2/6)ys  + (1/6)^  = ( l/3)( l/2)(y3  - 2y3  + y,  ) 
= ( 1/3) A3  = ( 1/ 3) ( — 1)  = -1/3. 

A?  "4  " \ “ y>  • [C-l/2)(y3  - yx  )] 

- yx  - Yl  + 3s  + ys-  + (l/2)y3  - (l/2)y, 

- (l/3)(l/2)(ya  - 2y3  + yx)  = (1/3)A3  = -1/3; 


- 4 - a*  - <*.  - y)  - 0 - - - — f 

- (-2/ 3)( l/2)(y3  - 2ya  + y,  ) = (-2/3)A3 

- ( — 2/ 3) ( — 1 ) = 2/3. 


yi  + y3  + y3 

3 
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A numerical  check  on  these  calculations  can  be  made  by  adding  the  linear 
and  quadratic  components  for  a particular  recovery  to  the  mean,  y.  The  result 
should  be  equal  to  the  recovery;  that  is, 

yk  = y + Ak  + Al  (28> 

For  the  third  level  of  conditioning  time, 

y3  = y + A3  + A|  = 88-1/3  + 3 + (-1/3)  = 91 

Hie  average  measures,  A and  A2,  of  the  linear  and  quadratic  components,  respec- 
tively, and  these  components  in  terms  of  A and  A2  are  listed  for  further 
reference : 

A = ( l/2)(y3  - yx  ) 

A2  = (l/2)(y3  - 2y2  + yx  ) (29) 


\ = (-l)A 
= (0)A 
A3  = (+1)A 


Af  = (1/3  )A2 
A|  = (-2/3)A2 
A|  = ( 1/ 3)A2 


The  investigation  of  the  linear  and  quadratic  components  for  a three- 
level  experiment  is  relatively  easy  because  of  the  simplicity  of  the  design. 

The  investigation  of  polynomial  components  for  experiments  with  more  than 
three  levels  requires  a different  approach.  However,  the  development  of  the 
average  measures  of  the  polynomial  components  may  be  carried  out  on  the  basis 
of  principles  developed  in  the  discussion  of  the  three-level  zinc  flotation 
experiment. 

Consider  a zinc  flotation  experiment  with  all  the  tests  given  in  the  pre- 
vious experiment  and  with  two  additional  replicate  tests  with  5 minutes'  condi- 
tioning time  at  the  fourth  level  for  which  the  average  zinc  recovery  is  92  per- 
cent; that  is,  yx  = 85;  ys  = 89;  y3  = 91;  y4  = 92;  and  y = 89-1/4.  The 
assumed  true  response  function  for  the  experiment  is 

Y = Mq  + MaXa  + HaaX|  + MaaaX|.  (31) 

The  average  measures.  A,  A2,  and  A3,  of  the  linear,  quadratic,  and  cubic  com- 
ponents, respectively,  now  must  be  calculated  for  the  first,  second,  and  third 
powers  of  Xa  in  equation  (31). 

The  algebraic  origin  of  the  conditioning  time,  Xa,  is  first  shifted  from 
0 to  3-1/2  minutes.  The  zinc  recoveries  then  may  be  represented  as  shown  in 
figure  6.  There  are  six  possible  pairs  of  points  which  give  measures  of  the 
linear  component  due  to  the  changes  in  conditioning  time  shown  in  figure  6. 
These  are 


rnmmmm: 


(y4  - ?3  > + (?3  - ?a ) + (?a  - ?x )» 
(?4  - y3)  + (y3  - ?a)» 


y4  - yx  = (y4  - y3)  ^ 

y4  - ys  = (?4  - ?3)  n 

y4  - y3  = (?4  - ?3). 


y3  - Xi 

y3  - y8 
y2  - ?i 


(y3  - ya ) + <?a  - % ) 
(?3  - ?a>» 

(y3  - yx  )• 


The  equations  (32)  show  that  the  six  pairs  of  points  provide  10  basic  measures, 
yk  - ?k-i  » °f  the  linear  component.  Hie  average  measure.  A,  of  the  linear 

component  is 


<l/10)(3y4  + y3  - ?a  - 3y,  ) 

(1/ 10)[3(92)  + 91  - 89  - 3(85)] 


2-3/10. 


As  previously  defined  the  difference  between  two  adjacent  measures  of  the 
linear  component  provided  a measure  of  the  quadratic  component.  Hierefore, 
any  two  adjacent  linear  component  measures  in  the  group  listed  on  the  left 
side  of  equation  (32)  can  be  used  to  obtain  a measure  of  the  quadratic  effect. 
If  two  adjacent  linear  component  measures  span  unequal  factor  spaces,  suitable 
weighting  must  be  employed.  For  example,  if  the  quadratic  component  were  zero, 
the  measure,  y4  - ya , of  the  linear  component  would  be  twice  as  great  as  the 
adjacent  measure,  ya  - yx  , of  the  linear  component,  whereas  if  the  quadratic 
component  were  not  zero,  the  quantity  (y4  - ya ) - 2(ys  - yx ) would  be  its 
measure.  There  are  four  such  measures  of  the  quadratic  component;  that  is. 


(y4  - y3)  - (y3  - ys) 
2(y4  - y3)  - (y3  - yx ) 

(y4  - ya)  - 2(ys  - yx ) 
(y3  - ya)  - (ys  - y: ) 


(y4  - 2y3  + y2), 

2y4  - 3ya  + ?! 

2(?4  - 2y3  + ys ) + (y3 
?4  - 3ys  + 2yx 
(?4  - 2y3  + ?3)  + 2(y3 
(?3  - 2ya  + yx). 


- 2ya  + ?!  ) , 


- 2ya  + ?!  ) , 


From  equations  (23)  and  (24)  in  the  case  of  the  three-level  zinc  flota- 
tion experiment,  it  was  seen  that  the  quantity  y3  - 2ya  + y from  three 
equally  spaced  adjacent  points  provides  two  basic  measures  of  the  quadratic 
component,  just  as  two  adjacent  points  provide  one  basic  measure  of  the  linear 
component.  The  4 measures  of  the  quadratic  component  on  the  left  side  of  equa- 
tion (34)  provide  8 measures  of  the  quadratic  component  in  the  form 
yk  " 2yk_i  + y^-a  for  a total  of  16  basic  measures  of  the  quadratic  component. 
Hie  sum  of  the  effects  in  (34)  is  4y4  - 4y3  - 4ya  + 4yx  , and  the  average 
measure.  A3,  of  the  quadratic  component  is 

A2  - (l/16)(4y4  - 4y3  - 4ya  + 4yx  ) = (l/4)(y4  - ?3  - ?a  + ?j  ) 

- ( l/4)(92  - 91  - 89  + 85)  = -3/4.  (35) 
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The  discussion  of  equation  set  (34)  reveals  that  the  basic  measures  of 
the  quadratic  component  are  (l/2)(y4  - 2y3  + ys)  and  (l/2)(y3  - 2ya  + yx  ).  In 
general,  for  a three-level  factor  the  average  measure  of  the  quadratic  compo- 
nent is  the  difference  between  the  two  adjacent  basic  measures  of  the  linear 
component  divided  by  the  number  of  factor  units  spanned  by  the  measure  of  the 
quadratic  component.  Similarly,  the  average  measure  of  the  cubic  component, 

A3 , for  four  levels  is  equal  to  the  difference  between  the  two  adjacent  basic 
measures  of  the  quadratic  component  divided  by  the  number  of  factor  units 
spanned  by  the  measure  of  the  cubic  component;  that  is. 


A3  = (l/3)[(l/2)(y4  - 2y3  + ys ) - <l/2)(y3  - 2ys  + yx )] 

= (l/6)(y4  “ 3y3  + 3ys  - yx  ) 

= (l/6)[92  - 3(91)  + 3(89)  - 85)]  = 1/6.  (36) 


FIGURE  6.  - Recovery  Versus  Conditioning  Time  in  o 
Four-Level  Experiment. 


It  now  is  desirable 
to  determine  the  relation- 
ship between  the  polyno- 
mial components  A^,  A^, 
and  Aj^  and  the  correspond- 
ing average  measures  of 
such  components.  A knowl- 
edge of  this  relationship 
makes  it  possible  to  cal- 
culate the  polynomial 
components.  These  poly- 
nomial components  are 
related  to  the  average 
recovery  at  the  k'th  level 
of  conditioning  time  as 
follows : 

?k  " * s Ak  - y + Ak 

+ Afc  + A^.  (37) 

There  are  functions  of  Xg 
that,  \Aien  multiplied  by 
the  average  measures.  A, 

A2  , and  A3  , will  give  the 
corresponding  linear,  quad- 
ratic, and  cubic  components 
shown  in  equation  (37). 

Let  these  functions  be 
zik»  zsk»  an<*  zak  such  that 
the  following  relation- 
ships hold  true: 


l 


t 
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Ak  “ ^k* 

4 = A2zsk»  (38) 

Ak  = A3z3k* 

From  equations  (37)  and  (38)  we  now  obtain 

yk  = y + ^k  + A2zsk  + A3z3k’  <39> 

In  a more  general  way  the  response  function  in  equation  (39)  may  be  written 

y = y + AZj  + A2Z2  + A3Z3.  (40) 

The  determination  of  the  functions  Z1  , Za , and  Z3  can  be  accomplished  by 
an  investigation  of  the  estimated  response  function  for  equation  (31);  that  is, 

y - mo  + maXa  + “aa^  + maaa4*  <41> 

The  first  step  in  the  investigation  is  to  transform  equation  (41)  so  that  it 
is  expressed  in  terms  of  a new  independent  variable  with  its  origin  at  the 
midpoint  of  the  range  of  the  conditioning  time,  Xa.  The  transformation 
Xa  = Xa'  + 7/2  will  make  Xa  = Xa  - 7/2  the  new  independent  variable  with  origin 

at  Xa  = 7/2  as  shown  in  figure  6.  The  substitution  of  Xa  = Xa  + 7/2  in  equa- 
tion (41)  makes  it  possible  to  write  the  response  function  as 

y - Po  + PaXa  + ?aaXa  + ?aaaXa  » <42> 

V = P0  + Pa<Xa  - 7/2)  + Paa<Xa  * 7/2>2  + Paaa^a  “ W3.  <43> 

The  pi  in  equations  (42)  and  (43)  are  functions  of  the  mi  in  equation  (41), 
which  are  determined  when  like  powers  of  Xa  are  collected  after  the  original 
substitution. 

From  equation  (43)  and  the  individual  zinc  recoveries  we  obtain  the 
general  relationship, 

?k  = Po  + Pa(xa  “ 7/2)k  + Paa(xa  " ?/2)k2  + Paaa(xa  “ 7/2)k3.  (44) 

From  equation  (44)  and  the  values  of  (Xa  - 7/2)k  shown  in  figure  6 we  obtain 
yx  - P0  “(3/2)pa  + (9/4)paa  - (27/8)paaa , 

= Po  -(1/2)Pa  + <1/4>Paa  ’ (1/8>Paaa»  (45) 

y3  - P0  + <l/2)Pa  + (l/4)Paa  + (1/8)Paaa» 


y4  = pQ  + (3/2)pa  + (9/4)paa  + (27/8)paaa 
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The  sum  of  the  equations  (45)  is,  £kyk  = 4p0+  (20/4)paa,  and  the  mean,  y,  is 

y - P0  + (5/4)paa.  (46) 

From  equations  (33)  and  (45)  we  obtain 

A = (l/10)(3y4  + y3  - y3  - 3^) 

= ( l/10)[3po  + (9/2)pa  + (27/4)paa  + (81/8)paaa 
+ Po  + Cl/2)pa  + (lM)paa  + (l/8)paaa 

- PQ  + d/2)Pa  - d/4)paa  + d/8)Paaa 

- 3Pq  + (9/2)pa  - (27/4)paa  + (81/8)Paaa] 

= (l/10)[10pa  + (41/2)paaa]  = pa  + (41/20)paaa.  (47) 

In  a similar  way  we  obtain  from  equations  (35),  (36),  and  (45) 

A2  = ( 1/ 4)(y4  - y3  " ?a  + ^ ) = Paa»  (*8) 

A3  = ( l/6)(y4  - 3ya  + 3y2  - yx ) = paaa.  (49) 

From  equations  (43),  (46),  (47),  (48),  and  (49)  we  obtain  the  relationship, 
y = [y  - (5/ 4)A2 ] + [A  - (41/20 )A3](Xa  - 7/2) 

+ A2(Xa  - 7/2)2  + A3(Xa  - 7/2 )3 
= y + A(Xa  - 7/2)  + A2 [ (Xa  - 7/2)2  - 5/4] 

+ A3 [ ( Xa  - 7/2 )3  - (41/20)(Xa  - 7/2)].  (50) 

A comparison  of  equations  (40)  and  (50)  shows  that 
Z,  = (Xa  - 7/2), 

Z8  = (Xa  - 7/2)2  - 5/4,  (51) 

Z3  = (Xa  - 7/2 f - (41/20)(Xa  - 7/2). 

The  functions  Z1  , Z8 , and  Z3  have  been  shown  to  be  polynomials  in  Xg.  For  the 
k'th  level  of  conditioning  time, 

Zlk  = (Xa  - 7/2)k, 

zak  “ <xa  - 7/2>k2  “ 5/4, 
zsk  " <xa  " 7/2)k3  " (41/20)(Xa  - 7/2)k. 


(52) 
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Equations  (52)  and  the  values  of  (Xa  - 7/2)^  from  figure  6 give 

Zn  = "3/2. 

Zia  = -1/2, 

Zxa  - +1/2, 

Zj4  = +3/2, 

Z31  = (-3/2)2  - 5/4  = +1, 

Z88  = (-1/2)2  - 5/4  = -1, 

(53) 

Z23  = (+1/2)2  - 5/4  = -1, 

Za4  = (+3/2)2  - 5/4  = +1, 

Z31  = (-3/2)3  - (41/20)(-3/2)  = -3/10, 

Z33  = (-1/2)3  - (41/20)(-l/2)  = +9/10, 

Z33  = (+1/2)3  - (41/20)(+l/2)  = -9/10, 

Z^  = (+3/2)3  . (4i/20)(+3/2)  = +3/10. 

From  equations  (33),  (35),  (36),  (38),  and  (53)  the  values  of  the  polyno- 
mial components  are  calculated  as 

Aj  - AZll  - (23/10)(-3/2)  = -69/20, 

Ag  = AZ13  = (23/ 10)(-l/2)  = -23/20, 

A3  = A Z13  = (23/10)( 1/2)  = +23/20, 

A4  = AZ14  = (23/10)(3/2)  = +69/20, 

A?  = A2Zal  =■  (-3/ 4)(  1)  = -3/4, 

A2  = A2Zaa  = (-3/ 4)(-l)  = 3/4,  (54) 

A§  - A2Za3  = (-3/ 4)(-l)  = 3/4, 

At  - A2Za4  - ( — 3/ 4) ( 1)  = -3/4, 

A2  - A3Z31  - (l/6)(-3/10)  = -1/20, 


t ~ A3Z3! 

3 - a3z33 

As  " A3Z33 


(l/6)(9/10)  = 3/20, 

( 1/ 6)( —9/ 10)  = -3/20, 
( l/6)(3/10)  = 1/20. 


A numerical  check  on  these  calculations  can  be  made  by  adding  the  linear, 
quadratic,  and  cubic  components  for  a particular  recovery  to  the  mean,  y.  The 
result  should  be  equal  to  the  recovery;  namely, 

y3  = y + As  + A!  + 4 

= 89-1/4  + 23/20  + 3/4  + (-3/20)  = 91. 

The  determination  of  these  polynomial  components  permits  the  examination 
of  their  importance  from  level  to  level  in  an  experiment  and  the  analysis  of 
their  importance  as  compared  to  the  experimental  error  and  to  one  another. 

The  evaluation  of  their  importance  serves  as  a guide  to  further  design  of  • 

experiments. 

Orthogonal  Transformation  of  Response  Functions 

There  is  a mathematical  transformation  of  a response  function  for  experi- 
ments with  evenly  spaced  factor  levels  that  facilitates  the  investigation  of 
factor  effects  and  their  polynomial  components  and  makes  it  possible  to  design 
experiments  so  that  the  investigation  may  be  concluded  with  a minimum  of  work 
in  designs,  experiments,  and  analyses  of  data.  Furthermore,  the  use  of  the 
principles  that  underlie  the  transformation  yields  more  satisfactory,  compre- 
hensive, and  conclusive  information  about  the  relationships  involved  in  many 
research  problems.  The  transformation  is  called  orthogonal  because  it  changes 
the  form  of  the  response  function  so  that  the  terms  of  the  response  function 
form  a set  of  orthogonal  polynomials  in  the  factors  involved  and  the  coeffi- 
cients of  the  terms  form  a set  of  orthogonal  polynomials  in  the  observations 
of  the  experiment.  Orthogonality  is  basically  a mathematical  concept  of  two 
or  more  functions  which  can  be  graphed  as  vectors  crossing  each  other  at  right 
angles.  A discussion  of  orthogonality,  with  examples,  is  presented  next  before 
proceeding  to  the  detailed  development  of  orthogonal  transformations. 

The  development  of  the  polynomial  components  and  the  average  measures  of 
the  polynomial  components  involves  the  fundamental  principles  underlying  the 
orthogonal  transformation  to  be  discussed.  The  zinc  recovery  was  the  response 
and  the  conditioning  time  was  factor  Xa,  with  levels  k = 1,  2,  3,  4 correspond- 
ing to  2,  3,  4,  and  5 minutes.  The  estimated  response  function  for  the  experi- 
ment was  shown  in  equation  (41),  as  follows: 

y = uiq  + maXa  + maaX|  + maaaX%. 

The  transformed  equation  was  shown  in  equation  (40)  to  be 

y = y + AZ1  + A2ZS  + a3z3  , 

A = (l/10)(3y4  + y3  - y3  - 3yx), 

A2  = ( 1/ 4)(y4  - y3  - ys  + yx), 

A3  = (l/6)(y4  - 3y3  + 3ys  - y1),  (55) 

\ = (Xa  - 7/2), 

Zs  = (Xa  - 7/2 )2  - 5/4, 

Z3  = (Xa  - 7/2 )3  - (41/20)(Xa  - 7/2). 
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The  average  measures,  A,  A2,  and  A3  of  the  linear,  quadratic,  and  cubic 
effects,  are  coefficients  of  the  Z polynomials  and  are  hereafter  referred  to 
as  the  linear,  quadratic,  and  cubic  coefficients  of  the  corresponding  polyno- 
mial components  of  the  effect  of  factor  Xfl.  These  coefficients  are  orthogonal 
polynomials  in  the  observations  y^.  The  coefficients  are  called  orthogonal 
because  the  sum  of  the  products  of  the  coefficients  of  the  corresponding 
observations  is  zero.  For  example,  consider  the  coefficients  A2  and  A3  in 
equation  set  (55): 

A2  = ( 1/ 4)y4  - <l/4)y3  - ( l/4)ya  + (1/4)^  , 

A3  = ( l/6)y4  - (3/6)ya  + (3/6)yg  - (1/6)^. 

The  sum  of  the  products  of  the  corresponding  coefficients  of  the  y^  is 

( 1/ 4)  ( 1/ 6)  + (— 1/ 4)(— 3/ 6)  + (-1/4X3/6)  + <l/4)(-l/6)  = 0. 

It  later  will  be  shown  that  the  coefficients  of  the  observations  in  equation 
(55)  are  simple  multiples  of  the  Z polynomials.  The  sum  of  the  products  of 
the  corresponding  coefficients  of  the  y^  is  zero  because  it  contains  the  fac- 
tor £kzakZ3k  = 0.  The  Z^  in  equation  set  (51)  also  are  called  orthogonal  poly- 
nomials because  the  shown  in  equation  set  (53)  have  the  property  that 
£kzikztk  = 0;  i Is  t;  i,  and  t = 1,  2,  3;  for  example, 

£kz2kz3k  = (+D(-3/10)  + (-1X+9/10)  + (-1X-9/10)  + (+l)(+3/10)  = 0. 

It  would  be  undesirable  to  develop  polynomial  components  and  their  coef- 
ficients in  the  laborious  manner  of  the  preceding  section.  Fortunately,  the 
principles  underlying  the  orthogonal  transformation  make  possible  the  develop- 
ment of  a comprehensive  system  for  the  calculation  of  the  orthogonal  coeffi- 
cients and  the  separation  of  the  polynomial  components  with  relative  ease, 
however  complex  the  problem  may  be. 

Transformation  of  One-Factor  Response  Functions 

To  construct  a comprehensive  system  for  experimental  design  and  analysis 
of  the  components  of  complex  response  functions,  it  is  necessary  to  start  from 
simple  and  easily  understood  principles.  These  will  be  developed  by  detailed 
discussion  and  illustration  of  the  orthogonal  transformation  of  one-factor 
response  functions.  For  an  experiment  involving  one  factor,  Xa,  with  evenly 
spaced  levels  k = 1,  . . .,  n and  with  replicates  r = 1,  ...,  q,  the  assumed 
true  response  function  is 

Y = Mc  + MaXa  + MaaX|  + . . . + Maa..#aXg-1.  (56) 

As  previously  discussed,  it  is  convenient  to  use  the  central  values  of  the 
response  and  the  factor  range  in  an  experiment  for  the  evaluation  of  responses 
and  the  measurement  of  effects.  If  w represents  the  spacing  between  levels  of 
factor  Xa,  the  origin  of  factor  Xa  can  be  shifted  to  the  center  of  the  experi- 
ment with  a change  in  factor  units  by  the  transformation 


r 

t 
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_ _ Xa  - Xa 
6 ■ ""  • 

w 


For  the  factor  levels  k = 1,  2,  . . .,  n,  the  center  of  the  n levels  is  -2— ^_i. 
The  relationship  between  z and  k then  will  be 


i n + 1 
zk  - k - 2 ‘ 


and  specifically. 


2i  = " 


zs  = - 


n - 1 
2 * 
n - 3 
2 ’ 


_ n ~ 3 
zn-l  2 ’ 


n - 1 
Zn  2 ’ 

If  n is  odd  the  values  of  z will  be  integers,  and  if  n is  even  the  values  of 
z will  be  odd  multiples  of  1/2. 

This  shift  in  the  factor  origin  makes  it  possible  to  write  the  true 
response  function  in  the  form 

Y = Nq  + NjZ  + Ngza  + . . . + Nn_j  z11-1 . (60) 

The  Ni,  i = 0,  1,  2,  . . . , (n  - 1),  in  equation  (60)  are  determined  by  col- 

lecting terms  with  like  powers  of  z when  the  substitution  Xa  = wz  + Xa  is  made 
in  equation  (56).  If  02  were  the  linear  component  of  the  true  effect  of  Xa 
and  in  general  0iZi  were  the  i'th  polynomial  component  of  the  true  effect  of 
factor  Xa  corresponding  to  the  successive  i'th  powers  of  Xa  in  equation  (56), 
the  true  response  function  may  be  expressed  in  the  form 

Y = 0OZO  + \\  + 0gZ3  + . . . + 0n.1Zn.1.  (61) 

The  separation  of  the  true  effect  of  factor  Xa  into  the  independent  components 
02 Zj  , . . .,  0n_!  Zn_j  can  be  accomplished  if  the  are  orthogonal  polynomials 
in  the  variable  z = (Xa  - Xa)/w  of  equation  (60).  The  condition  of  orthogonal- 
ity of  the  Zi  insures  the  independence  of  the  terms  0iZi.  The  response  func- 
tion in  equation  (61)  is  the  orthogonal  transformation  of  the  response  function 
shown  in  equation  (56). 

The  0^  of  equation  (61)  can  be  estimated  from  the  observations  obtained 
when  the  experiment  is  performed.  The  estimated  response  function  then  will 
be  of  the  form 

y = A°Z0  + AZj  + A2Zg  + . . . + An“x Z^  . (62) 

Equation  (62)  is  similar  in  form  to  equation  (40)  which  was  obtained  in  the 
previous  discussion  of  the  four-level  single-factor  zinc  flotation  experiment. 
The  A*  of  equation  (62)  are  estimates  of  the  0^of  equation  (61)  and  can  be 
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calculated  in  terms  of  the  values  of  Zj  at  the  factor  levels.  In  the  general 
case  of  response  functions  with  the  form 

MpX?  + MjX  + MgX2  + . . . + (63) 

for  n equally  spaced  values,  X=0,  1,  2,  ..  . , (n-1),  the  derivation  of 
orthogonal  polynomials,  P^,  has  been  shown  by  Kendall6  and  is  explicitly  shown 
for  P , Pa , . . . , Pe ; PQ  » 1.  For  the  variable  z in  equation  (60)  the 
orthogonal  polynomials,  Z^,  of  equations  (61)  and  (62)  can  be  calculated  from 
Kendall's  P^  by  the  substitution  X = z + (n  - l)/2.  The  results  are  as 
follows : 


Zo  = 1. 

Z,  = z , 


Zs  = z2 


Z3  = z3 


Z4  = z4 


Z5  = z6 


Z6  = z6 


n2  - 1 
12  » 

T 3n2  - 71 

" L 20  JZ’ 


ran2  - 131,  3(n2  - lXn2  - 9) 

L 14  f 560 

+ [. 


15n4  - 230n2  + 407 


1008 


z. 


SOn2  - 31 
44 


* [' 


5n4  - llOn2  + 3291 
176  J2 


. 5(n2  - l)(n2  - 9)(n2  - 25) 

14784 


(64) 


If  z = (Xfl  - 7/2)  and  n = 4,  the  resulting  polynomials  Z1  , Zs , and  Z3  are 
the  same  as  previously  shown  in  equation  set  (51)  for  the  four-level  zinc  flo- 
tation experiment.  Bennett  and  Franklin6  have  given  a recursion  formula  by 
means  of  which  the  value  of  any  i'th  Z polynomial  may  be  calculated  from  the 
values  of  Z ^ , Z^.g  , and  Zl  . In  the  terminology  of  this  paper  the  formula  is 

-(i-l)2[n2  - (i  - l)2]- 


zi-izi 


. 4 [4(i  - l)2  - lj  J Zi-»* 


(65) 


Since  equations  for  the  Z±  polynomials  have  been  established,  it  now  is 
possible  to  obtain  a general  solution  for  the  A1  in  terms  of  the  corresponding 
Zik  where  the  Zik  are  the  values  of  Z±  at  the  k'th  level.  For  the  k'th  level 
of  factor  Xa  the  general  relationship  of  equation  (62)  becomes 


A°Zok  + AZ1 


+ a2z 


2 k 


+ An-1Z 


(n-$k  * 


(66) 


5Kendall,  M.  G. , The  Advanced  Theory  of  Statistics:  Hafner  Publishing  Co., 
New  York,  N.Y.,  vol.  2,  3d  ed. , 1951,  pp.  159-161. 

6Bennett,  C.  A.,  and  Franklin,  N.  L. , Statistical  Analysis  in  Chemistry  and 
the  Chemical  Industry:  John  Wiley  & Sons,  Inc.,  New  York,  N.Y.,  1954, 
p.  257. 


The  values  of  the  Z^  of  equation  (66)  are  determined  from  equation  set 
(64)  for  the  values  of  z = at  the  k'th  level.  There  are  n equations  of  the 
type  of  equation  (66)  in  n unknowns  A° , A,  A2 , . . . , An_1  . The  A*  can. 


therefore,  be  resolved  in  terms  of  the  Zik,  and  the  resulting  general  formula 
for  the  A1  is 


skzikyk 


^kzlk 


gkrzikykr 
<£kzik  * 


From  equation  set  (64)  ZQ^  = ZQ  = 1 and  the  solution  for  A0  is 


£k(D 


3-  = y- 


From  equation  (68)  the  first  term  in  equation  (62)  is  A°Z0  = y( 1)  = y,  which 


is  the  same  as  the  first  term  in  the  response  function  of  equation  (40)  for 
the  single-factor  zinc  flotation  experiment  previously  discussed.  Equation 
(62)  then  may  be  written 


y = y + AZj  + A2Z2  + . . . + A^Z^  . 


By  means  of  the  relationships  in  equations  (57)  and  (64),  equation  (69)  may  be 
converted  to  the  form 


y = *0  + maXa  + maaK  + * * * + maa. . -a5^-1  * 


The  coefficients  in  equation  (70)  are  estimates  of  the  corresponding  coeffi- 
cients in  the  true  response  function  of  equation  (56). 


The  values  of  the  Z ^ frequently  are  fractional,  and  all  contain  a common 
factor  for  a particular  Z^  and  a particular  value  of  n.  Values  of  Zj£k  other 
than  Zok  have  been  extensively  tabulated  by  Fisher  and  Yates7  for  the  set  of 
smallest  possible  integral  values  of  Zik  such  that  Zik  = WiZik  and  Wi  = a con- 
stant. Substitution  of  Zik  = Zfk/Wi  in  equation  (67)  gives  an  expression  for 
the  A1  in  terms  of  the  zfk!  that  is. 


£kr<zik/wi)ykr/q£k(zik/wi)2  ~ Wiv:krzikykr/c£kzik2 


In  this  report  the  quantities  £krzikykr  an<*  ^kzik2  are  represented  by  and  Qt, 
respectively.  Equation  (71)  then  may  be  written 


A1  = (W1/qQi)^i. 


The  values  of  and  Qi  are  common  to  all  Zfk  f°r  a particular  value  of  n and 
a particular  value  of  i.  Values  of  and  Qi  also  are  tabulated  with  the 
values  of  the  Zfk*  The  values  of  Z{k*  Wi,  and  for  small  values  of  n may  be 
easily  determined  from  equation  set  (64)  by  use  of  the  relationships  discussed 
here.  Values  of  Z^k,  Wi,  and  Qi  for  values  of  n = 2,  3,  4,  and  5 are  shown  in 
table  1. 


Fisher,  R.  A.,  and  Yates,  F. , Statistical  Tables  for  Biological,  Agricultural, 
and  Medical  Research:  Hafner  Publishing  Co.,  Inc.,  New  York,  N.Y. , 1957, 
pp.  90-100. 
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TABLE  1.  - Numerical  values  for  the  mean,  linear,  quadratic. 

cubic,  and  quartic  orthogonal  polynomials 


Design 

H 

2 

■PH 

n 

= 4 

n = 5 

va  lue 

z' 

ok 

z' 

ok 

fSt 

m 

z 9 
^ok 

B9 

z 9 
*k 

Z 9 
Z3k 

Z ' 
ok 

Q3 

z 9 
»k 

z 9 
Z3k 

z;  1 
z*k 

of  z 

Wm 

Wm 

Wm 

Wm 

-2 

wm 

+1 

-2 

■a 

-i 

+1 

+i 

-3 

+i 

BH 

-1 

+1 

+1 

D 

+2 

-4 

+1 

1 

Wm 

+i 

-1 

-i 

+3 

H 

* 0 

+1 

-2 

+1 

-2 

+6 

+1 

+i 

+i 

+1 

-i 

-3 

+1 

+1 

+i 

+1 

+1 

+i 

-1 

-2 

-4 

+i 

+3 

+i 

+1 

+2 

+1 

+2 

+2 

+1 

+1 

Qi 

2 

2 

3 

2 

6 

20 

20 

5 

B 

B 

10 

70 

Wi 

1 

2 

1 

1 

3 

■ 

2 

10/3 

1 

B 

B 

5/6 

35/12 

’^The  factor  level  k = 1,  2,  3,  4,  5 is  counted  from  the  top  for  any  particular 
Values  for  the  mean,  linear,  quadratic,  cubic,  and  quartic  orthogonal 
polynomials  are  Zok>  Vk*  Zg'k»  Zgfc,  and  Z^,  respectively. 


The  for  n = 2 shown  in  table  1 usually  are  not  published  because  of 
the  simplicity  of  a one-factor  experiment  at  two  levels.  The  Z^  for  n = 2 
and  all  Z^  have  been  included  in  table  1 in  order  to  facilitate  the  explana- 
tion of  how  to  separate  the  polynomial  components  when  more  than  one  factor  is 
involved.  The  inclusion  of  these  features  in  the  table  makes  possible  a com- 
prehensive notation  which  is  helpful.  A characteristic  of  the  Z*k  and  the  Zfk 
except  Z0k  and  Z^  for  all  values  of  n is  that 

SkZik  = SkZik  = °*  (73) 

This  characteristic  is  helpful  in  later  developments  and  may  be  observed  for 
the  Z^  in  table  1. 

The  calculation  of  the  polynomial  coefficients  A,  A2,  and  A3  according  to 
equation  (72)  now  will  be  demonstrated  for  the  case  of  the  four- level  zinc 
flotation  experiment  previously  discussed,  and  the  quantities  will  be  compared 
with  those  previously  obtained  and  listed  in  equation  set  (55).  For  this  pur- 
pose equation  (72)  may  be  modified  to  give  a function  in  terms  of  the  mean 
observations  yk  as  follows: 


A1  - (Wi/qQiX^i  = (Wi/qQi^krZ^yj^  = (Wi/qQ^EkZ/^ rykr 

= (Wi/Qi^kZ^Erykr/q)  " <VQi>Zkzikyk*  <74> 


By  means  of  equation  (74)  the  coefficients  A,  A2,  and  A3  then  may  be  expressed 
as 

a - (v^jEfcZ/kPk = (i^/Qi)(zi'^  + V-y- + ^ + 

A2  - (Wa/qa)ZkZ^ltyk  * (VQbXZ^y!  + zaSys  + Zs/3y3  + Za'4y4),  (75) 

A3  - (W./^fekZ,^  = (Wa/Qa)(Zg1y1  + Z3'a92  + Z3'3y3  + Z3'4^4). 

By  using  the  values  of  Z^  , W^,  and  as  shown  in  table  1 for  n = 4,  the 

values  of  A,  A2,  and  A3  in  equation  set  (75)  are  shown  to  be 

A = (2/20)(-3y1  - ys  + y3  + 3y4), 

A2  = (l/4)(yx  - ya  - y3  + y4),  (76) 

A3  = [( 10/3)/20](-y1  + 3ya  - 3y3  + y4). 

A comparison  of  the  quantities  in  equation  set  (76)  with  those  in  equation  set 
(55)  will  show  that  they  are  identical.  It  thus  has  been  demonstrated  that 
the  values  of  the  polynomial  coefficients  calculated  on  the  basis  of  equation 
(72)  are  the  same  as  those  previously  obtained  by  laborious  analysis  of  the 
effects. 

Transformation  of  Two-Factor  Response  Functions 

The  usefulness  of  the  results  obtained  in  the  orthogonal  transformation 
of  a one-factor  response  function  for  a similar  transformation  of  a two-factor 
response  function  now  will  be  demonstrated.  A generalization  then  will  be 
made  for  response  functions  containing  more  than  two  factors. 

Consider  an  experiment  in  the  flotation  of  manganese  in  which  the  true 
manganese  recovery  is  the  true  response,  Y,  and  the  factors  are  conditioning 
time  represented  by  Xa  and  temperature  represented  by  Xjj.  Let  the  number  of 
replicates  be  r = 1,  . . . , q,  and  specifically  let  r = 1,  2.  Let  the  levels 
of  Xa  be  k = 1,  2,  3,  corresponding  to  3,  4,  and  5 minutes' conditioning  time, 
respectively,  and  let  the  levels  of  X^  be  m = 1,  2,  3,  4,  corresponding  to  30° 
40°,  50°,  and  60°  C. , respectively.  The  manganese  recovery  for  5 minutes' 
conditioning  time,  a temperature  of  40°  C. , and  the  first  replicate  of  the 
test  is  denoted  by  y38  . In  general  the  manganese  recovery  for  the  k'th  level 
of  conditioning  time,  the  m'th  level  of  temperature,  and  the  r'th  replicate  is 
ykmr*  ^ie  corresPonding  sum  of  replicate  recoveries  is  yj^  = Er-y^mi- » and  the 
corresponding  average  manganese  recovery  is  yfem*  bet  the  sums  of  replicate 
recoveries  be  as  shown  in  the  array  (77)  and  the  corresponding  average  manga- 
nese recoveries  as  shown  in  the  array  (78). 


yn 

- 154.0 

yia 

- 156.0 

Yia  * 158.0 

Yi4 

- 160.0 

yai 

- 158.0 

Yas 

- 162.0 

yaa  ■ 164.0 

y8* 

- 166.0 

Y31 

- 160.0 

yse 

- 164.0 

Y33  ■ 168.0 

ya* 

- 170.0 

*11 

= 77.0 

*13 

= 78.0 

*13 

= 79.0 

*14 

*81 

» 79.0 

*33 

= 81.0 

*33 

= 82.0 

*34 

*31 

= 80.0 

*33 

= 82.0 

*33 

= 84.0 

*34 

y 

« 80-5/6 

The  assumed  true  response  function  for  the  experiment  is 

Y -Mo  + V.  + Vb  + M.,K  + "bb*!  + Wb  + \bxaxb 
+ MabbXa*!  + “abbbVl  + \,bKh  * \mK*L 
* «..bbbxlxt,-  o»> 

In  equation  (79)  the  product  variables  XflXb,XaX^,  etc.,  are  included  in  order 

to  account  for  the  mutual  influence  which  the  two  factors  have  upon  each  other 
in  affecting  the  manganese  recoveries.  This  mutual  influence  is  called  an 
interaction,  and  the  Interaction  effects  may  be  evaluated  separately  from  the 
main  effects  of  conditioning  time  and  temperature.  If  wa  were  the  spacing 
between  levels  of  conditioning  time  and  w^  were  the  spacing  between  levels  of 
temperature,  the  origin  for  the  two  factors  Xa  and  Xb  can  be  shifted  to  the 
center  of  the  experiment  with  a change  in  factor  units  by  the  transformations 


Xa  - Xa  Xa  - 4 .. 

Za  * — T~  * Xa  - 4, 

xb  - xb  xb  - 45  (80) 

Zfe  ~~  * ~ * 

This  shift  in  factor  origin  makes  it  possible  to  write  the  true  response  func- 
tion in  the  form 

Y * Noo  + *oza  + «oiZb  + + NoS*l  + No34  + Niizazb 

+ \*zazl  + Nrszazb  + \iz\zh  + N33z|zb 

+ Naaz|z|.  (81) 


The  i of  equation  (81)  are  constants  with  subscripts  corresponding  to 
J i 1 

the  powers  of  zfl  and  zg.  As  for  a one-factor  experiment,  equation  (81)  can  b 
orthogonally  transformed  into  the  equation 

Y “ ^00Z00  + ^10Z10  + ®01Z01  + ^30Z30  + ^OaZoa  + ^03ZO3 

+ ^1 1 Z1 1 + ^12Z12  + *13Z13  + Kz31  + «*a3ZSa  + ^3  3 Z3  3 * <82> 


ants  with  subscripts  corresponding  to 
factor  experiment,  equation  (81)  can  be 
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The  Zij  are  orthogonal  polynomials  of  the  i'th  degree  in  za  and  of  the  j'th 
degree  in  zb.  The  Zj.j  are  the  products  of  the  corresponding  Zi  and  Zj  for 
one-factor  transformations.  For  example,  Zx 3 = Z1 Z3  where  Z^  is  the  first 
degree  orthogonal  polynomial  in  Za,  and  Z3  is  the  third  degree  orthogonal 
polynomial  in  Zb«  From  equation  set  (64)  we  see  that 


-a.* 


Z3  * 'b 


- 1 ' [MK- 


Likewise, 


oo 


ZoZo  = (Dd)  = 1- 


(83) 


In  equation  (82)  the  terms  01QZ1O  and  0SoZao  are  the  polynomial  components  of 
the  true  main  effect  of  conditioning  time.  The  terms  0O1  ZQ1  , 0osZoS,  and 
0 3Z0g  are  the  polynomial  components  of  the  true  main  effect  of  temperature. 
The  terms  0 Z11  , . . .,  0S3Za3  are  polynomial  components  of  the  true  interac- 

tion effect  between  conditioning  time  and  temperature. 

Owing  to  the  orthogonality  of  the  Z^j,  the  coefficients  0^  can  be  inde- 
pendently estimated  from  the  values  of  the  manganese  recoveries.  The  esti- 
mated response  equation  will  be  of  the  form 

y = A°BPZ00  + ABPZ,0  + A°BZ01  + A2BPZbo  + A°B2Z02  + A°B?Zo8 

+ ABZ11  + AB2Z12  + AB3Z13  + A3BZ21  + A2B2Z22  + A2B3Z23.  (84) 

The  coefficients  A*-bJ  in  equation  (84)  are  coefficients  of  the  polynomials  Z^j 
and  are  estimates  of  the  coefficients  0^j  in  equation  (82).  In  the  coeffi- 
cients AiBJ  any  letter  with  a zero  superscript  may  be  conveniently  omitted  in 
the  expression  of  the  response  function,  and  the  letter  I is  herein  used  to 
represent  A°BP  or  any  coefficient  in  which  all  letters  have  zero  superscripts. 
Equation  (84)  then  may  be  written  < 


IZ 


30  + AZio  + BZ01  + A2zSo  + B2zoa  + B3  Z03  + ABZH 
+ AB2Z12  + AB3Z13  + A2BZ21  + A2B2Z22  + A2B3Z23. 


(85) 


As  in  the  case  of  a single-factor  experiment  the  A^B^  are  called  coefficients 
of  the  polynomial  components  of  the  effects. 

For  the  k'th  level  of  conditioning  time  and  m'th  level  of  temperature  the 
general  relationship  of  equation  (85)  becomes 

^km  = *Zoo(km)  + ^0(100)  + BZoi  (km)  + A2zao(km)  + BSZoa(km) 

+ B3zoa(km)  + ABZi  1 ( km)  + AB8zia(km)  + AB3zi3(km) 

+ A2BZai(km)  + ASB2zaa(km)  + ^^(km)*  <86> 


(86) 
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When  more  than  one  factor  is  involved  in  an  experiment,  the  subscript 
figures  ij  representing  the  degree  of  the  Zij  polynomials  will  be  outside  the 
parentheses,  and  the  subscript  figures  km  indicating  the  factor  levels  for  the 
values  of  the  Z^^m)  will  be  inside  the  parentheses.  There  are  km  = (3)(4) 

= 12  equations  of  the  type  shown  in  equation  (86),  and  there  are  12  unknowns 
I,  A,  B,  . . . , A2B3 . The  A£bJ  can,  therefore,  be  resolved  in  terms  of  the 
Zij(km)>  and  the  resulting  general  formula  for  the  A^bJ  is 

i j _ ^km?i.1(km)ykm  ^kmr^i  1(km)ykmr 

Ekn?l  j(km)  ^kn^ijKkm)  * 

From  equations  (87)  and  (83)  we  see  that 


I 


o t>o  ^km^oo(km)ykm  ^kn>ykm 

Ekn^c^km)  *2 


(88) 


From  equations  (88),  (84),  and  (83)  we  see  that  A°BPZ  * IZ0  = I = y,  and 
equation  (86)  expresses  each  recovery  as  the  sum  of  the  experiment  mean  recov- 
ery and  the  polynomial  components  of  the  main  effects  and  the  interaction 
effects.  When  the  A^B-J  have  been  determined,  the  relationships  of  equation 
sets  (64)  and  (80)  make  it  possible  to  convert  equation  (85)  to  the  form. 


y “ «b  + maXa  + ®hxb  + “aa^  + “bb*!  + “bbb3^  + mabXaXb 
+ mabbXaX^9  + mabbbXa4  + maab^aXb  + “aabbW 

+ “aabbbW*  <89> 

The  coefficients  in  equation  (89)  are  estimates  of  the  corresponding  coeffi- 
cients in  the  true  response  function  of  equation  (79). 


The  values  of  the  are  the  products  of  the  corresponding  Z^^)  and 

Zj(m)»  an<*  these  are  in  turn  related  to  the  Z^jjj  and  the  Zj(m)  of  table  1 by 
common  factors  W^  and  Wj.  It  is,  therefore,  possible  to  express  the  A^bJ  of 


equation  (87)  in  terms  of  ^j,  Wjj,  and  Q^j,  so  that 

zij*wijzij.  (9°) 

zij(km)  " zi(k)zj(m)  * (91) 

Qij  " EknPi^(km)  “ £kzi?k)sW!jtm)  = ^iQJ » (92> 

Wfj  - WfWj,  (93) 

*ij  " £kmrZij(km)ykmr  - £kmzij(km)£rykmr  * (94) 
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From  equations  (87)  and  (90)  to  (94), 

A j(km) 

[Skm(  2 i j ( 1^) /Wi  j )yjan] /Eian(  Z ^ j ( j^)  / j )3 

' <V«ij>«ta^j(I.)5ta>  (,5) 

= ( j /Q<  j )(Slfnn*Z/  j^]cni)y]f7nr/q)  “ j)^lj  * (96) 

The  calculation  o£  the  values  for  the  W^j,  and  Qjj  can  be  con- 

veniently made  by  a suitable  array  of  the  values  followed  by  orderly  multipli- 
cation. The  necessary  relationships  for  ij  ■ 23  are 

Z23(km)  = Zs7(k)Z3(m)» 

WS3  - waw3, 

Qsa  = ^8^3* 


It  should  be  noted  that  the  values  of  Ws  and  Qg  are  those  associated  with 
the  values  of  Z'(fc)  and  that  the  values  of  W3  and  Q3  are  those  associated  with 
the  values  of  Z3^m^.  A convenient  symbolic  arrangement  of  the  calculation  is 


7 7 

Z3(l) 

z ' 

Z3(l) 

7 > 

Z3(S) 

7 ' 

za(a) 

7 / 

Z3  (3  ) 
z 7 

za(3) 

7 ' 

Z3(4) 

W3 

Wa 

Q3 

% 

(97a) 

7 / 7 / 

Z3(l  )Z3(1) 

za(i  )Z3(a ) 

ZS(1  )Z3(3) 

z'  7 7 

za(i  )z3(4) 

w2w3 

%% 

za(2 )Z3 (l ) 

za(a)z3(a) 

za\a)z3,(3) 

za/(a)z3(4  ) 

(97b) 

za(a)z3(i  ) 

Za,(a)Z3,(a) 

Za(3)Z37(3) 

Z2  (3  )Z3  (4  ) 

The  products  in  the 

array  (97b) 

are  respectively  equal  to 

the  corresponding 

quantities  in  the  following  array: 

7 * 

a 3(1 1 ) 

7 ' 

“8  3(13) 

Z * 

“a  3(13) 

Z * 

*3 (14  ) 

w 

W23 

Qaa 

7 ' 

^a3(ai  ) 

7 ' 

^•a3(aa) 

Z ’ 

4‘83(a3) 

Z ' 

“8 3(34  ) 

(98) 

7 1 

a3 (31  ) 

7 ' 

a 3(38) 

z ' 

“83  (33  ) 

Z ( 

“33 (34  ) 

The  Zgg^jy,,)  i n the  array  (98)  are  the  appropriate  coefficients  for  the  corre- 
sponding values  of  y^  * Eryifmr.  given  in  the  array  (77),  which  must  be  used  to 

calculate  the  component  coefficient  ASB?  according  to  equations  (94)  and  (96). 
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From  table  1 the  actual  data  for  the  calculation  of  the  Z2'3(kin),  W23,  and 
Qgg  were  obtained  and  the  calculation  made  as  shown  in  the  array  99a-b: 


Factor  level 


n 

z' 

1 

2 

3 

4 

Q 

W 

4 

Z3  (m) 

-1 

+3 

-3 

+1 

20 

10/3 

3 

Z ' 

a(k) 

+1 

-2 

+1 

6 

3 

12 

ZS3(km) 

-1 

+3 

-3 

+1 

120 

10 

+2 

-6 

+6 

-2 

-1 

+3 

-3 

+1 

According  to  equation  (94), 


(99a) 


(99b) 


^23  ^kn?2  3(km)Srykmr  ^kmZs3(km)ykm*  (100) 

When  the  Z23^km)  of  the  array  (99b)  are  multiplied  by  the  corresponding  values 
of  y^  in  the  array  (77)  and  the  12  products  are  summed,  the  result  is 

t23  = (-1)( 154)  + ( 3) ( 156)  + (~3)( 158)  + (1)(160) 

+ (2)( 158)  + (-6)( 162)  + (6)( 164)  + (-2)(166) 


+ ( — 1) ( 160)  + ( 3) ( 164)  + (-3)( 168)  + (1)(170)  = -6. 


From  equation  (96)  we  obtain 

A2B3  = (Wa3/qQS3)rf33  = [ 10/ 2(  120)  ] ( — 6)  = -1/4.  (101) 

The  values  of  Z^y^,  j , and  Qjj  were  calculated  for  all  the  terms  of 

equation  (86)  and  are  listed  in  table  2. 

For  the  purpose  of  later  developments  it  is  noted  here  that 

skZij(km)  = £kzi(k)Zj(m)  = Zj(m)skZi(k)  = °*  <102> 

The  relationship  of  equation  (102)  holds  true  for  all  Z^ km)  except 

providing  that  the  summation  is  performed  over  levels  of  a factor  for  which 
i i4  0 in  equation  (102).  This  characteristic  may  be  observed  for  the  Z^y^ 

of  table  2.  The  values  of  the  component  coefficients,  A*B^ , for  all  terms  of 

equation  (86)  were  calculated  and  listed  in  table  3. 


TABLE  3.  - Values  for  the  coefficients  of  the  polynomial  components 
of  effects  for  a two-factor,  three- level  by  four- level, 
hypothetical  manganese  flotation  experiment 


ij 

*» 

^ijWij/qQij 

Component 

coefficients,  A*-B-3 

23 

-6 

( — 6)  ( 10)  / 2(  120) 

-1/4 

22 

2 

(2)(3)/2(24) 

1/8 

21 

2 

(2)(6)/2( 120) 

1/20 

13 

-2 

(— 2)( 10/3)/2(40) 

-1/12 

12 

-2 

(— 2)( l)/2(8) 

-1/8 

11 

14 

(14)(2)/2(40) 

7/20 

03 

0 

0 

0 

02 

-4 

( —4) ( 1) /2( 12) 

-1/6 

01 

80 

( 80)(2)/2(60) 

4/3 

20 

-10 

(-10)(3)/2(24) 

-5/8 

10 

34 

(34)(l)/2(8) 

17/8 

00 

1940 

( 1940)( 1) / 2( 12) 

80-5/6 

The  ij'th  component  of  the  effects  in  equation  (86)  can  be  determined  for 
any  of  the  yy^  by  multiplying  the  corresponding  value  of  Z^y^  by  the  value 

of  the  ij'th  component  coefficient  from  table  3.  The  values  of  Z^j^y^j  are 

determined  from  table  2 by  the  relationship  Zy^^s  = Z^y/y^/W.  For  example, 
A2]*3  = -1/4,  ZS3^1;1  ^ = -1/10,  and  the  quadratic  A by  cubic  B component  of  the 

interaction  effect  between  conditioning  time  and  temperature  for  the  average 
manganese  recovery  y2  3 is 


A2B3Za3(11  } = (-1/4X-1/10)  = 1/40. 

A numerical  check  on  the  calculations  for  the  results  shown  in  table  3 can  be 
made  by  calculating  and  summing  all  of  the  polynomial  components  for  any  mean 
recovery  yicm  and  by  adding  the  sum  to  the  experiment  mean  recovery,  y.  The 
result  should  be  equal  to  the  mean  recovery,  y^m.  For  example,  we  obtain  from 
equation  (86), 

^11  y + ^1  0(11  ) + ASZ30(11  ) + BZ01  (11  ) + ®2Z02(11  ) + ®3Z03(ll  ) 

+ ABZj  !(!!)+  ABP Zj  s J + AB3Z1  3 j ) + a2  BZS1  j ) 

+ *SB%S<1.)  +j'SB»Z2S(ii)  • (103) 

When  the  values  of  A^bJ  and  Zy./^ \ are  substituted  in  equation  (103)  the 
result  is 

ylx  = 80-5/6  + (17/8)(-l)  + (4/3)(-3/2)  + (-5/8)( 1/3)  + (-1/6)(1) 

+ (0)(-3/10)  + (7/20>(3/2)  + ( — 1/ 8) ( — 1 ) + ( - 1/ 12 )( 3/ 10) 

+ (l/20)(-3/6)  + (l/8)(l/3)  + (-1/4X-1/10) 

- 77. 


M 


I 


ft 
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The  value  of  yn  = 77  percent  manganese  recovery  checks  with  the  value 
originally  given  in  the  array  (78). 

Transformation  of  Many-Factor  Response  Functions 

The  orthogonal  transformation  of  the  response  function  to  calculate  the 
polynomial  main  effects  and  interaction  effects  of  an  experiment  with  any  num- 
ber of  factors  having  equally  spaced  levels  is  a simple  extension  of  the  cal- 
culations for  one  and  two  factors.  The  Z',  W,  and  Q for  any  component  effect 
is  the  product  of  the  individual  Z' , W,  and  Q associated  with  the  power  of 
each  factor  involved  in  the  aforesaid  component  effect.  Consider  the  calcu- 
lation of  the  interaction  coefficient  AC?  for  an  experiment  with  four  factors 
with  levels  and  replication  as  follows: 


Factor 

Xa 

xb 

Xc 

Xd 

Replication 

The  coefficient  AC?  is  calculated  as  follows: 


Factor  Levels 


k 

= 

1, 

I 

s 

m 

= 

1, 

> 

t 

n 

= 

1, 

* 

u 

P 

= 

1, 

9 

V 

r 

= 

1, 

9 

q 

(104) 


AC?  = AWEP=  ^ 020W1020/qQ1080, 

^1  oso  ^kmnpZ  l oso(kmnp)^r^kmnpr » 

Zio2o(kmnp)  ~ Zi  (k)Zo(m)Z2(n)Zo(p)  * Zi(k)Z2(n)» 
^020  “ H (k)Wo(m)W2 (n)^o(p)  = Wi(k)^2(n)* 
o2o  = Qi  (k)Qo(m)Qs(n)Qo(p)  = Qi  (k)tQ2(n)v 
= tvQi  (k)Qs(n)* 


(105) 

(106) 

(107) 

(108) 

(109) 


It  should  be  noted  that  in  equation  (105)  the  interaction  coefficient  AC2 
was  first  expressed  as  a product  involving  all  factors  by  the  equivalent 
expression  tJ-lP&TP.  The  subscripts  on  the  quantities  61030>  ^ioso*  ^020* 
and  Qloso  are  the  same  as  the  superscripts  on  the  letters  in  the  expression 

A 1I?C?EP,  and  are  in  the  same  order.  The  successive  numbers  in  the  subscript 
1020  properly  identify  the  one-factor  orthogonal  polynomials  that  must  be 
multiplied  together  to  obtain  the  orthogonal  polynomial  which  the  coefficient 
AC?  modifies.  The  letters  k,  m,  n,  and  p in  the  subscripts  of  equations  (107), 
(108),  and  (109)  serve  to  identify  the  values  of  Z' , W,  and  Q which  must  be 
selected  from  table  1 for  the  respective  values  of  s,  t,  u,  and  v,  the  total 
number  of  levels  per  factor.  This  general  procedure  will  work  for  the  coeffi- 
cient of  any  polynomial  effect,  whatever  the  number  of  factors  may  be. 
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The  Algebraic  Expression  of  Design  Relationships 

The  theoretical  relationship  between  an  observation  in  an  experiment,  the 
effects  of  the  factors,  and  the  errors  involved  in  an  experiment  may  be  con- 
veniently assumed  or  postulated  in  the  form  of  a statistical  model  to  facili- 
tate proper  design  and  subsequent  analysis  of  experimental  data.  A statisti- 
cal model  is  an  algebraic  equation  that  symbolically  indicates  the  theoretical 
division  of  an  observation  into  suitable  components,  including  the  experiment 
mean  and  the  experimental  error.  For  quantitative  factors  the  other  compo- 
nents may  be  either  effects  or  polynomial  components  of  effects.  When  poly- 
nomial components  are  involved,  we  also  have  a response  equation  if  all  of  the 
factors  are  quantitative.  For  qualitative  factors  the  components  other  than 
the  mean  and  the  experimental  error  are  only  the  effects.  The  use  of  models 
provides  a convenient  and  systematic  method  of  evaluating  the  importance  of 
effects  for  both  quantitative  and  qualitative  factors  without  reference  to 
polynomial  components. 

A simple  one-factor  model  may  be  developed  on  the  basis  of  previous  dis- 
cussion on  orthogonal  transformations.  Consider  the  general  case,  previously 
discussed,  of  the  one-factor  experiment  with  the  true  response  function  shown 
in  equation  (56);  namely, 

Y = M0  + MaXa  + MaaX|  + . . . + Maa_aX“-i. 

The  orthogonal  transformation  of  equation  (56)  was  given  in  equation  (61)  as 
follows : 

Y ~ 0OZO  + ^iZi  + ^2Za  + * * * + ^n-i  Zn-i  * 

For  the  purpose  of  developing  the  theory  of  models  let  02ZR  of  equation  (61) 
be  called  *A2  and  in  general  let 

M1  = 0iZi.  (110) 

The  quantities  *A*,  i = 1,  . . . , (n-1),  are,  therefore,  defined  to  be  the 
components  of  the  true  effect,  M,  of  the  factor  Xa;  that  is, 

*A  = *A  + *A2  + . . . + *An-1  . (Ill) 

For  the  k'th  level  of  factor  X„ , 

EL 

*Afc  = *Ak  + *A|  + . . . + *Ag-1 . (112) 

For  the  k'th  level  of  factor  Xa  the  estimated  response  function  for  equation 
(61)  was  shown  in  equation  (66)  as 

yk  * A°zok  + AZik  + A2Z2k  + • • • + An_1  Z(n-i  )k' 

The  term  A°ZQk  in  equation  (66)  is  an  estimate  of  0oZok  ■ *Ak,  and  since 
A°ZQk  ■ y,  y is  an  estimate  of  *Ak.  If  y,  is  the  true  mean  response  of  the 
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experiment,  then  y is  an  estimate  of  p,  and  is  the  same  as  ^ If  A^Z^ 

= A^,  equation  (66)  then  may  be  written 

^k=^+Ak+Ak+*  * * + * (U3) 

yk  = y + ik*  (H4) 

= estimate  of  *a£, 

A^  = estimate  of 

In  equation  (114)  y^  is  an  estimate  of  the  true  mean,  at  the  k'th 
level  of  factor  Xa.  Equation  (114)  is  an  estimate  of  the  equation 

M>k  = M-  + ( 115) 

Equation  (115)  is  equivalent  to  the  equation 

ykr  = u + *— k + *er(k)‘  <116> 

*er(k)  = ykr  " M-k’ 

Equation  (116)  is  the  statistical  model  for  a one-factor  experiment.  The 
quantity  *er(k)  is  the  true  error  in  the  observation  ykr.  The  replicate  num- 
ber represented  by  the  letter  r is  placed  last  in  the  subscript  of  the  obser- 
vation, ykr,  but  is  placed  first  in  the  subscript  of  the  error  term,  *er(k), 
whereas  the  factor  level  is  placed  in  parentheses  in  the  subscript  of  the 
error  term.  This  order  in  the  subscript  of  experimental  error  is  not  always 
used  in  the  current  literature  but  is  followed  in  this  report. 

Equation  (116)  was  developed  for  the  general  case  of  an  experiment  in 
which  the  factor  was  quantitative  and  the  effect  of  the  factor  could  have 
polynomial  components.  On  the  basis  of  equation  (112)  the  model  could  be 
expanded  as  follows: 

“ * **k  + K * • ■ • * **T  + *er(k)'  <U7> 

Although  equation  (116)  was  developed  for  a quantitative  factor,  the  same  equa- 
tion may  be  used  for  a model  of  an  experiment  with  one  qualitative  factor. 
However,  there  would  not  be  any  polynomial  components,  and  the  model  could  not 
be  expanded  to  the  form  of  equation  (117).  In  the  case  of  both  quantitative 
and  qualitative  factors,  equation  (116)  is  a proper  expression  of  the  model, 
and  the  true  main  effect  *Ak  is  the  difference  between  the  true  experiment 
mean  and  the  true  mean  at  the  k'th  level,  that  is,  from  equation  (115), 

" u*  (118) 

The  model  for  a two-factor  experiment  may  be  developed  on  the  basis  of 
previous  discussion  on  orthogonal  transformation.  Consider  the  two-factor 
manganese  experiment,  previously  discussed,  in  which  conditioning  time  was 


*k 
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factor  Xa  with  levels  k = 1,  2,  3;  temperature  was  factor  Xj,  with  levels 
m = 1,  2,  3,  4;  and  replication  was  r * 1,  2.  The  true  response  function  was 
given  in  equation  (79)  as  follows: 


Y “ Mo  4 MaXa  + MbXb  + *aa*k  + Mbb^  + MbbbXb  + MabXaXb 

+ *abb*a^  + *abbb*a*i  + *aab*L*b  + *aabb*!ft 
+ Maabbb^^* 

The  orthogonal  transformation  of  equation  (79)  was  shown  in  equation  (82)  as 

Y “ ^00Z00  + 0Z1  0 + ^01  Z01  + ^80^20  + ^0SZ0?  + ^03Z03 

+ ^11Z11  + ^la^ia  + ^13Z13  + ^21Z2l  + ^23Z22  + ^23Z23’ 

The  term  023Za3  from  equation  (82)  is  called  *ASB3 , and  in  general, 

= flijZij.  (119) 

According  to  equation  (119),  the  quantities  *A*  = *AiB° , i = 1,  2 are  defined 
as  the  components  of  the  true  main  effect  of  conditioning  time,  the  quantities 
*BJ  = *A°BJ,  j = 1,  2,  3 are  defined  as  the  components  of  the  true  main  effect 
of  temperature,  and  the  quantities  *AiB j , i,  j £ 0 are  defined  as  the  compo- 
nents of  the  true  interaction  effect  of  conditioning  time  and  temperature. 

Let  the  true  main  effect  of  conditioning  time,  the  true  main  effect  of 
temperature,  and  the  true  interaction  effect  of  the  two  factors  be  *A,  *B,  and 
*AB.  respectively.  For  the  k'th  level  of  conditioning  time  and  the  m'th  level 
of  temperature  the  following  relationships  then  hold: 

*A^  m *Aj^  + *A^ , 

*§m  * *Bm  + *®m  + *®m»  ( 120) 

- (*AB)jan  ♦ + (*AB»>km  ♦ (*A2B)kin, 

* <*A3B“)km  ♦ 

For  the  k'th  level  of  conditioning  time  and  the  m'th  level  of  temperature,  the 
estimated  response  function  for  equation  (82)  was  given  in  the  form  shown  in 
equation  (86);  namely, 

^km  " *Zoo(km)  + o(km)  + BZoi(km)  + A2Zso(km)  + B2zos(km) 

+ B3Zos(km)  + ABZii(km)  + A®2Zi2(km)  + AB3zi3(km) 

+ A3BZSi(km)  * ASB2Z22(km)  + A2B3z23(km)- 
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Let  the  nomenclature  for  the  terms  in  equation  (86)  be  changed  as  follows: 

Alzio(km)  = Ak» 

“No®)  " B«’  <m> 

AlBJziJ(km)  ' Wl»JW 

By  means  of  the  equalities  in  equation  set  (121)  and  the  fact  that 
IZ00(km)  = equation  (86)  may  be  written 

ykm  = y + Ak  + Afc  + Bm  + Bm  + + (^km  + <ab2  >km 

+ (AB3)^  + <A2B)km  + (A2B2)km  + (A8*3)^  (122) 

(123) 


m 


Bm  + ABkm* 

estimate 

of 

*Ak« 

estimate 

of 

*Bi. 

estimate 

of 

<*AiBj>km> 

estimate 

of 

*^k’ 

estimate 

of 

*B  , 

— m 

estimate 

of 

*AB.  . 

— km 

estimate 

of 

the  true  mean 

—km 


and  m.  Equation  (123)  is  an  estimate  of  the  equation, 

^km  = ^ + *4  + *An  + *— km’ 

*Ak  = M-k  " M-» 

*B 


(124' 


»*m  " 


*AB, 
— km 


^km  ‘ ^ + *^k  + = ^km  " ^k  ' ^m  + »*' 

Equation  (124)  is  equivalent  to  the  equation, 

y 

kmr 

*er(km)  = ykmr  “ ^km* 


u + *A  + *B  + *AB  + *e  „ . 

~ k — m km  r(km) , 


( 125] 


Equation  (125)  is  the  statistical  model  for  a two-factor  experiment  with  fac- 
tors Xa  and  Xb.  The  quantity  *er(ian)  is  the  true  error  in  the  observation 
:mr< 


Because  conditioning  time  and  temperature  were  quantitative  factors  with 
three  and  four  levels,  respectively,  the  model  could  be  expanded  on  the  basis 
of  equation  set  (120)  as  follows: 

ykmr  = H + *Ak  + *A=  + *Bm  + + *B2  + (*AB)to 

+ (*AB2  )km  + (*AB3  )km  + (*A2B)km  + (*ASB2  )km 

+ (*A2B3)km  + *er(km)*  (126) 

However,  if  one  or  both  of  the  factors  had  been  qualitative,  equation  (126) 
would  not  be,  but  equation  (125)  would  be,  a proper  expression  for  the  model. 

If  one  factor  was  quantitative  and  one  factor  was  qualitative,  the  model 
of  equation  (125)  could  be  expanded  for  components  of  the  quantitative  factor. 
It  is  possible  for  such  components  to  vary  from  level  to  level  of  the  qualita- 
tive factor  and  to  produce  interactions  between  the  components  and  the  main 
effect  of  the  qualitative  factor.  For  example,  suppose  that  factor  Xa  was 
quantitative  and  factor  X^,  was  qualitative.  In  conformity  with  previous  nota- 
tion the  true  interaction  of  the  linear  component  of  with  *B  for  the  levels 
k and  m would  be  designated  by  (*AB)km.  The  true  interaction  between  the 
quadratic  component  of  and  the  main  effect  for  the  levels  k and  m would 
be  (^B)^.  The  relationship  of  these  quantities  is 

*— Bkm  - (*AB)km  + (*A2B)km.  (127) 

With  the  notation  of  equation  (127)  the  model  of  equation  (125)  could  be 
expanded  to 

ykmr  " * + *Ak  + + + (*^}km  + (*A2-S)km  + *er(km)*  ( 128) 

Since  the  notation  of  the  effects  and  their  components  for  interactions 
can  be  rather  cumbersome  with  increased  number  of  factors,  the  letter  *1  with 
a subscript  indicating  the  factor  levels  involved  has  been  adopted  for  the 
general  expression  of  some  models  in  this  report.  This  notation  can  be  used 
for  all  of  the  unexpanded  models  such  as  the  two-factor  model  of  equation 
(125);  that  is, 


ykmr  = » + ^k  + *An  + *1km  + ^(km)*  (129) 

When  all  of  the  factors  are  quantitative  as  in  the  case  of  the  two-factor 
model  of  equation  (126),  the  general  expanded  model  can  be  written 

y,  =u+*A1+*A2+*B  + *32  + *8®  + *1“  + *l}2 

kmr  K k K m m m km  km 

+ ‘Si  + + + (130) 

The  meaning  of  the  notations  in  equation  (130)  is  illustrated  by  the  meaning 
of  Ij^,  namely,  1^  is  the  quadratic  A and  the  linear  B interaction  at  the 
k'th  level  of  factor  Xa  and  the  m'th  level  of  factor  Xb.  When  some,  but  not 


all,  of  the  factors  are  qualitative  as  in  the  case  of  the  model  in  equation 

(128) ,  the  expanded  model  can  be  vritten 

'tar  - •*  + * *4  * *£*  * *Ilto  + HS  * *er(km)'  <13I> 

In  equation  (131)  the  letter  m in  the  superscript  of  *1^®  indicates  that  the 
main  effect  *Bm  is  involved  in  the  interaction  rather  than  a component  of  *Bm. 

For  most  purposes  of  experiment  design  it  is  adequate  to  express  a statis 
tical  model  in  the  shorter  form  similar  to  equations  (116),  (125),  and  (129). 
An  appreciation  of  the  possible  importance  of  some  of  the  polynomial  compo- 
nents is  sufficient  to  insure  consideration  of  an  adequate  number  of  factor 
levels  in  a design  to  investigate  such  components. 

An  important  feature  of  a model  such  as  (130)  for  which  the  factors  are 
quantitative  is  that  the  summation  of  any  polynomial  component  over  all  design 
levels  of  any  factor  involved  in  the  component  is  zero.  This  feature  is  illus 
trated  in  the  case  of  the  estimated  effect  1^  for  the  model;  that  is, 

*£  * - A2B?Z23(1-)- 

Em^km  = S:mA2B3Z3  3(km)  = A2 3 ( km)  = (A2B3/W2  3)EmZ33(km)  “ °* 

This  result  is  confirmed  by  equation  (102).  In  general, 

vii-Vta-0-  <132> 

As  a consequence  of  these  relationships  the  summation  of  any  effect  for  such  a 
model  over  all  design  levels  of  a factor  involved  in  the  effect  is  zero.  This 
is  illustrated  for  the  unexpanded  form  of  the  model  as  shown  in  (129);  that  is 

*km  * ^ij-Si’ 

EkXkm  = Em1km  = ^nPij^m  = SijEm1kin  = °*  *133^ 

Results  similar  to  those  of  (133)  are  easily  demonstrated  for  main  effects. 
Although  the  results  were  demonstrated  for  quantitative  factors,  they  also 
apply  to  the  effects  of  qualitative  factors. 

Crossed  Classification 

Models  generally  are  classified  for  convenient  reference  as  being  either 
crossed  or  nested.  Some  models  have  both  of  these  characteristics.  Equation 

(129)  is  an  example  of  a model  with  a two-way  crossed  classification.  Crossed 
classification  exists  when  (1)  each  factor  level  in  an  experiment  occurs  with 
one  or  more  levels  from  each  of  the  other  factors  and  (2)  every  observation  of 
the  response  for  each  test  having  a specific  factor  level  contains  an  effect 
both  constant  and  peculiar  to  that  factor  level. 
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Examples  of  crossed  classified  models  for  three-  and  four-factor  experi- 
ments are  shown  in  equations  (134)  and  (135): 


U + *A,  + *B  + *C  + *1, + *1.  + *1 

H -^k  — m — n km  kn  mn 


+ *^kmn  + *er(kmn)» 


(134) 


y.  = u,  + *A,  + *B  + *C  + *D  + *L  + *L  + *1. 
kmnpr  ^ -*k  —m  — n — p xm  kn  Tip 

+ *Imn  + *Imp  + *Inp  + *Ikmn  + *Xkmp  + *Xknp 

+ *Imnp  + **kmnp  + *er(kmnp)' 

* . t • f ^ . i.  . . 1 i _ a*  i.t_  _ j - ^ j 


(135) 


In  any  model  it  is  possible  that  any  one  or  all  of  the  interaction  effects 
could  be  postulated  to  be  absent  on  the  basis  of  prior  knowledge.  All  of  the 
common  statistical  models  may  be  considered  either  as  examples  of  crossed 
classification  or  as  special  cases  of  crossed  classification  either  with  or 
without  the  occurrence  of  nested  classification.  An  example  of  such  a special 
case  is  the  model  of  equation  (116)  which  may  be  described  as  having  a one-way 
crossed  classification  because  only  one  factor  is  involved. 

Nested  Classification 

The  term  nested  classification  describes  a model  for  a design  or  a frac- 
tion of  a design  in  \Aiich  there  is  a successive  replication  of  a factor  level 
or  levels  associated  with  each  replication  of  some  preceding  factor  level  or 
levels.  For  example,  suppose  that  three  batches  of  ore  were  sampled  from  a 
carload  of  ore,  three  samples  were  taken  from  each  batch,  and  two  chemical 
analyses  were  made  on  each  sample.  Hie  three  batches  of  ore  are  replicates, 
the  nine  samples  are  replicates  of  three  within  batches,  and  the  eighteen 
analyses  are  replicates  of  two  within  samples.  This  is  illustrated  in 
figure  7. 

The  replicate  batches,  samples,  and  analyses  of  figure  7 may  be  numbered 
in  any  possible  order;  for  example,  the  batches  can  be  ordered  xyz  as  shown  in 
figure  7 or  they  could  be  ordered  yzx,  zxy,  yxz , xzy,  or  zyx.  Only  the  sam- 
ples within  a particu- 

Botches  »o  yO  *0  lar  batch  can  be 

ordered;  that  is,  the 

/ \ / \ \ samples  from  batch  y 

/ \ \ could  be  ordered  stu, 

/IV  / 1 V J I V tus,  ust,  sut,  uts,  or 

ompias  po  qO  rO  *0  to  uO  A A A tsu*  Likewise  only 

l\  \ \ \ \ \ \ \ \ the  analyses  on  a par- 

I \ \ \ \ \ \ M M M ticular  sample  can  be 

I \ i \ I \ \ I \ \ \ \ \ ordered;  f°r  example, 

AnoiytM  006000  OOOOOO  000000  the  analyses  for  sam- 
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pie  q could  be  ordered 

FIGURE  7.  - A Desian  With  a Nested  Classification.  cd  or  dc. 
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FIGURE  7.  - A Design  With  a Nested  Classification. 
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In  general,  a peculiarity  of  nesting  Is  that  there  Is  no  correlation 
between  the  occurrence  of  a particular  nuntoered  replicate  of  a factor 
nested  within  a replicate  of  a factor  Xa  and  the  occurrence  of  a replicate  of 
Xj,  with  the  same  number  within  another  replicate  of  Xa,  irrespective  of  the 
order  in  which  the  replicates  may  be  numbered.  Furthermore,  the  replicates  of 
Xfe  having  the  same  number  from  replicate  to  replicate  of  Xa  do  not  have  a con- 
stant and  common  effect.  Under  such  conditions  interactions  do  not  exist 
between  the  factors  Xa  and  X^. 

Assume  that  the  response  for  the  design  of  figure  7 is  percent  lead  in 
the  ore.  Let  batches  be  factor  Xa  with  replicates  i = 1,  2,  and  3.  Let  sam- 
ples be  factor  Xfc  with  replicates  j = 1,  2,  and  3.  The  main  purpose  of  the 
sampling  experiment  is  to  examine  the  errors  due  to  sampling,  and  therefore, 
only  batches  and  samples  are  formally  classified  as  factors.  The  duplication 
of  analyses  is  for  determining  error  due  to  chemical  analysis  and  would  not  be 
necessary  if  the  chemist  already  has  a good  estimate  of  the  error  due  to  ana- 
lytical procedures.  Let  the  replicate  analyses  be  k = 1,  2.  The  percent  lead 
for  the  k'th  replicate  analysis  of  the  j'th  sample  from  the  i'th  batch  is 
represented  by  yijk*  The  mean  value  of  both  replicate  lead  analyses  of  the 
j'th  sample  from  the  i'th  batch  is  represented  by  yij.  The  mean  value  of  all 
the  lead  analyses  in  the  design  is  represented  by  y.  The  difference  between 
these  two  mean  analyses  is  equal  to  the  combined  effect  upon  the  lead  analysis 
of  the  procedures  used  in  selecting  the  batches  and  the  samples.  This  fact  is 
expressed  by  the  equation 


?ij  “ y = &±  + JBj(i).  (136) 

In  equation  (136)  the  subscript  of  indicates  that  the  j'th  replicate 

sample  is  nested  within  the  i'th  replicate  batch. 

Rearrangement  of  equation  (136)  shows  that  the  mean  lead  analysis  yj,  is 
equal  to  the  experiment  mean  analysis  plus  the  effects  due  to  selecting  tne 
batches  and  the  samples;  that  is. 


- y + *!+!]<!)• 

Equation  (137)  is  an  estimate  of  the  true  relationship, 

Ll  = u+  *A+*B„. 

* j * " j ( i) 

*A£  ■ “ M> 

*-j(i)  ” ^ij  " ^i 

Equation  (138)  is  the  equivalent  of  the  expression, 


ijk 


“ + + *£j(«  * **k(ij)' 

*6k( i j ) = yijk  " ^ij 


(137) 


(138) 


(139) 


45 


Equation  (139)  is  a two-factor  statistical  model  with  a nested  classification. 
The  term  Is  read  "the  true  effect  B underline  sub  j within  i"  and  the 

error  term  *ek(ij)  is  read  "e  sub  k within  ij."  In  each  case  of  nested  repli- 
cates, the  factor  replicates  within  which  another  replicate  is  nested  are 
placed  within  parentheses  which  occur  last  in  the  subscript.  The  order  of  the 
factor  replicates  within  the  parentheses  is  the  same  as  their  order  in  the  sub- 
script of  the  observation.  An  important  point  concerning  the  model  shown  in 
equation  (139)  is  that  the  model  is  true  for  any  actual  observation  y^j^,  irre- 
spective of  how  many  other  observations  may  be  obtained.  For  example,  the 
model  is  true  for  the  case  of  an  experiment  with  one  batch,  one  sample  from 
that  batch,  and  only  one  analysis  for  the  sample.  This  illustrates  a charac- 
teristic of  the  relationship  between  models  and  designs;  namely,  it  is  possi- 
ble for  many  designs  to  have  the  same  model. 

The  extension  of  these  ideas  to  cases  in  which  greater  numbers  of  factors 
are  nested  is  easily  made.  With  reference  to  nested  classification  the  term 
rank  is  used  to  indicate  the  position  of  a nested  factor.  The  model  of  equa- 
tion (139)  is  called  a two-rank  nested  model.  Batches  are  the  first  rank,  and 
samples  are  the  second  rank.  Only  primary  factors  under  investigation  are 
referred  to  in  the  classification  of  nested  models.  The  term  rank  originated 
from  the  idea  of  a hierarchical  relationship  between  the  successive  factors  of 
a nested  classification.  In  the  literature  the  terms  hierarchic  and  nested 
are  used  synonymously. 

The  variety  of  ways  in  which  nesting  can  occur  within  a crossed  classifi- 
cation is  considerable,  and  the  proper  expression  of  a model  with  these  two 
characteristics  is  an  aid  both  to  proper  design  and  to  the  ease  of  subsequent 
analysis  of  the  experimental  data.  An  illustration  of  a problem  involving  a 
model  with  both  a crossed  and  nested  classification  is  given  in  the  section  on 
nested  designs. 


Factorial  Designs 


When  experiments  are  designed  statistically,  it  is  possible  to  estimate 
important  effects,  to  obtain  quantitative  measures  of  the  confidence  to  be 
placed  in  the  estimates,  to  compare  the  relative  importance  of  the  various 
effects,  and  to  calculate  empirical  or  theoretical  mathematical  relationships 
between  the  factors  and  the  response.  When  experiments  are  not  statistically 
designed,  attempts  to  evaluate  relatively  large  amounts  of  data  by  statistical 
methods  are  frequently  disappointing.  Sometimes  the  information  contained  in 
the  data  may  be  sketchy  and,  therefore,  only  a few  rough  comparisons  can  be 
made,  or  perhaps  some  important  effects  actually  are  confused  so  that  separate 
estimates  of  these  effects  cannot  be  obtained. 

If  only  the  main  effects  of  several  factors  are  important  and  no  signifi- 
cant interactions  exist,  then  the  classical  method  can  be  used,  in  which  the 
effect  of  one  factor  is  estimated  by  a series  of  tests  in  which  that  one  fac- 
tor is  varied  and  all  others  are  held  constant.  This  method  is  continued 
until  all  factors  have  been  investigated  separately.  If  there  were  a possibil- 
ity of  interaction  effects  between  the  factors,  however,  the  classical 
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approach  would  not  detect  nor  yield  estimates  of  such  effects.  Furthermore, 
when  only  main  effects  are  important,  certain  statistical  designs  may  yield 
the  desired  information  with  fewer  tests  than  are  required  by  a classical 
approach  and  will  provide  additional  information  concerning  the  statistical 
reliability  of  results  obtained. 

The  factorial  design,  one  of  the  more  important  statistical  designs,  is 
one  in  which  several  factors  may  be  studied  at  a specified  number  of  levels 
with  all  possible  combinations  of  the  factor  levels  Included  in  the  design. 

Such  a design  may  be  used  to  investigate  the  effects  of  both  qualitative  and 
quantitative  factors.  The  main  and  interaction  effects  which  involve  only 
quantitative  factors  may  be  analyzed  for  linear,  quadratic,  cubic,  and  higher 
order  components,  depending  upon  the  number  of  levels  chosen  for  these  factors. 
With  factorial  designs  it  is  possible  to  obtain  independent  estimates  of  main 
effects  and  all  possible  interaction  effects  of  the  factors  being  studied. 
Interaction  effects  are  common  between  chemical  and  physical  factors.  For 
example,  the  time  to  reach  maximum  metal  reduction  may  decrease  as  roasting 
temperature  is  increased,  or  the  best  leaching  temperature  at  one  pH  value  may 
not  be  the  best  temperature  at  another  pH  value.  Experience  with  specific 
systems  of  physicochemical  factors,  however,  may  have  demonstrated  that  cer- 
tain factors  do  not  interact,  and  such  experience  should  be  used  in  planning 
the  research  program.  However,  in  the  absence  of  previous  evidence,  it  is 
unwise  to  assume  that  factors  do  not  interact. 

A factorial  design  for  the  manganese  flotation  experiment  in  which  three 
levels,  3,  4,  and  5 minutes,  are  selected  for  conditioning  time  and  four 
levels,  30°,  40°,  50°,  and  60°  C. , are  selected  for  temperature  includes  the 
12  possible  combinations  of  these  factor  levels.  From  the  results  of  an 
experiment  based  on  this  design  it  is  possible  to  estimate  the  linear  and 
quadratic  components  of  the  main  effect  of  conditioning  time;  linear,  quad- 
ratic a d cubic  components  of  the  main  effect  of  temperature;  and  six  compo- 
nent) . the  interaction  effect  between  the  two  factors. 

If  t><.me  of  the  interaction  effects  are  not  important,  it  may  be  possible 
to  use  a fractional  factorial  design  in  which  only  those  tests  which  are  suf- 
ficient for  estimating  the  important  effects  are  selected  from  the  correspond- 
ing full  factorial  design.  Fractional  factorial  designs  have  the  advantage  of 
reducing  the  number  of  tests  to  be  made  and  yet  have  some  of  the  properties  of 
full  factorial  designs.  The  principles  underlying  fractional  factorial 
designs  must  be  understood  before  attempting  to  use  them,  however,  or  else 
Important  effects  may  be  confused  so  that  they  cannot  be  estimated  separately. 

The  following  notation  is  used  to  represent  the  individual  tests  in  a 
design  in  order  to  facilitate  explanation  of  the  properties  of  factorial 
designs.  When  a single  quantitative  factor  Xa  is  to  be  tested  at  several 
levels,  the  test  made  at  the  lowest  level  is  represented  by  aQ , and  the  tests 
made  at  successively  higher  levels  are  represented  by  a1  , a? , a3 , and  so  forth. 
This  system  of  numbering  for  the  subscripts  simplifies  the  later  application  of 
principles  from  mathematical  group  theory  to  determine  the  tests  which  are  to 
be  included  in  a fractional  factorial  design.  It  must  be  remembered  that  the 
response  ya  corresponds  to  test  aQ,  ys  to  ^ , and  so  on.  In  general,  the 
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subscript  for  the  test  symbol  is  one  less  than  that  for  the  corresponding 
response.  Tlie  levels  of  a qualitative  factor  may  be  arranged  in  any  conven- 
ient order  so  as  to  use  this  notation  when  referring  to  the  tests  made. 

The  notation  is  extended  to  experiments  involving  more  than  one  factor. 

In  the  manganese  flotation  experiment,  let  Xa  represent  time  and  Xfe  represent 
temperature,  Hie  letters  a and  b with  appropriate  subscripts  identify  the 
levels  of  factors  Xa  and  X^,  respectively,  in  the  test  symbols  for  this  experi- 
ment. The  test  made  at  the  second  level  of  conditioning  time,  4 minutes,  and 
third  level  of  temperature,  50°  C. , is  represented  by  the  symbol  a1 bg . An 
additional  advantage  of  simplicity  in  notation  is  gained  if  the  letter  corre- 
sponding to  a particular  factor  is  omitted  whenever  that  factor  is  at  its 

lowest  level.  Thus,  the  test  made  for  5 minutes  at  30°  C.  is  represented  by 

ag , the  test  made  for  3 minutes  at  40°  C.  is  represented  by  b:  , etc.  The  test 

made  at  the  lowest  level  of  all  factors  is  represented  by  (1).  This  last 

symbol  is  chosen  for  its  correspondence  with  the  concept  of  an  identity  in 
group  theory  and  further  simplifies  the  procedures  for  determining  the  tests 
to  be  included  in  a fractional  factorial  design.  The  12  tests  in  the  facto- 
rial design  for  the  manganese  flotation  experiment  are  shown  in  table  4 
together  with  the  appropriate  test  symbols. 

TABLE  4.  - Tests  in  a factorial  experiment  to  investigate 
recovery  of  manganese  by  flotation  in  which 
three  levels  of  conditioning  time  and  four 
levels  of  temperature  are  studied 


The  independence  of  estimates  of  the  various  effects  measured  in  a facto- 
rial experiment  depends  upon  the  number  of  times  each  test  in  the  design  is 
made.  Suppose  that  each  of  the  tests  in  the  full  factorial  design  for  the 
manganese  flotation  experiment  were  made  twice  to  obtain  an  estimate  of  experi- 
mental error  as  well  as  estimates  of  the  various  factor  effects.  An  average 
manganese  recovery  for  3 minutes'  conditioning  time  then  can  be  calculated 
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from  the  results  of  eight  tests,  two  tests  made  at  each  of  the  four  levels  of 
temperature.  Similar  average  manganese  recoveries  can  be  calculated  for  4 and 
5 minutes'  conditioning  time.  Since  each  average  is  obtained  from  the  same 
number  of  tests  at  each  of  the  levels  of  temperature,  comparisons  which  are 
independent  of  temperature  can  be  made  between  these  average  recoveries  for 
the  three  levels  of  conditioning  time.  Comparisons  which  are  independent  of 
conditioning  time  also  can  be  made  between  average  recoveries  for  the  four 
levels  of  temperature.  Independent  estimates  of  the  main  effects  of  time  and 
temperature  are  obtained  by  means  of  these  comparisons.  The  data  also  will 
yield  an  estimate  of  the  interaction  effect  between  time  and  temperature  \rfiich 
is  independent  of  both  main  effects.  A design  that  will  give  independent 
estimates  of  all  important  effects  is  said  to  be  orthogonal. 


If  one  of  the  tests  in  a factorial  design  were  not  made,  the  experiment 
would  not  be  orthogonal.  Suppose  the  test  a1  b2  for  4 minutes  at  50°  C.  were 
omitted,  but  the  other  11  tests  in  the  design  were  each  made  once,  then  the 
average  manganese  recovery  for  4 minutes  is  obtained  over  only  three  levels 
of  temperature,  whereas  the  average  recoveries  for  3 and  5 minutes  are 
obtained  over  four  levels  of  temperature.  Also,  the  average  recovery  at 

50°  C.  is  obtained  only  over  two  levels  of  conditioning  time,  whereas  the 

average  recoveries  at  30°,  40°,  and  60°  C.  are  obtained  over  three  levels  of 

time.  Comparisons  between  averages  for  different  levels  of  time  will  not  be 

independent  of  temperature,  nor  will  comparisons  for  different  levels  of 
temperature  be  independent  of  time. 


A similar  situation  arises  if  all  of  the  tests  in  a factorial  design  are 
made,  but  some  of  the  tests  are  not  made  as  many  times  as  the  other  tests. 
Procedures  have  been  devised  for  evaluating  data  from  some  types  of  nonorthog- 
onal  experiments,  but  they  are  more  complex  than  procedures  for  evaluating 
orthogonal  data.  In  some  instances  the  missing  values  can  be  estimated  from 
the  data  available  so  that  the  advantages  of  orthogonality  can  be  used  in 
approximate  methods  of  analysis. 
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There  are  designs  that  are  orthogonal  even  though  all  tests  are  not  made 
the  same  number  of  times.  Consider  a factorial  experiment  to  study  the 
effects  of  two  factors,  Xa  and  Xj,.  The  estimates  of  the  various  effects  will 
be  independent  if  the  number  of  replicates  of  each  test  made  at  any  level  of 
factor  Xa  is  a constant  multiple  of  the  number  of  replicates  of  each  corre- 
sponding test  made  at  any  other  level  of  factor  Xa.  For  example,  suppose 
both  factors  are  studied  at  three  levels.  The  tests  at  the  first  level  of 
factor  Xa  may  each  be  made  once,  those  at  the  second  level  twice,  and  those 
at  the  third  level  three  times.  Similarly,  the  tests  at  the  first,  second, 
and  third  levels  of  factor  Xb  may  each  be  made  twice,  once,  and  three  times, 
respectively.  These  two  ratios  for  the  levels  of  factor  Xa  and  the  levels  of 
factor  Xb  may  be  incorporated  into  a single  experiment.  The  following  array 
gives  the  number  of  replicates  to  be  made  for  each  test  in  the  two-factor, 
three- level  factorial  design: 
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A design  which  includes  proportional  replication  of  the  kind  illustrated  in 
the  array  (140)  is  orthogonal  and  is  useful  when  the  magnitude  of  experimental 
error  varies  from  one  factor  level  to  another.  This  application  also  is  dis- 
cussed in  the  section  on  the  importance  of  experimental  error  in  design  con- 
siderations. The  more  common  design  in  which  all  tests  are  made  the  same  num- 
ber of  times  is  simply  a special  case  of  proportional  replication. 

Two-Level  Factorial  Designs 

Factorial  designs  in  which  two  levels  are  selected  for  each  factor  are 
particularly  applicable  in  the  conduct  of  preliminary  research  and  in  the 
determination  of  optimum  operating  conditions.  Suppose  a standard  procedure 
were  used  to  leach  a copper  ore  at  25  percent  solids  for  30  minutes  at  25°  C. 
The  question  arises  as  to  whether  a better  set  of  leaching  conditions  can  be 
found  for  this  particular  ore.  Three  factors  which  may  be  varied  are  Xa,  per- 
cent solids;  Xjj,  leaching  time;  and  Xc,  leaching  temperature.  For  purposes  of 
illustration,  the  second  levels  for  each  factor  may  be  chosen  to  be  35  percent 
solids,  40  minutes,  and  35°  C.  There  is  a good  chance  for  interactions  to 
exist  between  these  factors;  a higher  percent  solids  may  require  longer  leach- 
ing time,  a higher  temperature  may  reduce  the  time  necessary  to  obtain  optimum 
copper  extraction,  and  so  on.  A full  two-level  factorial  design  to  estimate 
the  three  main  effects  and  four  possible  interaction  effects  consists  of  eight 
tests.  The  results  of  these  eight  tests  will  give  estimates  of  the  relative 
magnitudes  of  the  various  effects.  If  an  estimate  of  experimental  error  were 
available,  the  results  would  show  which  of  the  effects  are  important  enough 
for  further  investigation. 

It  may  be  desirable  to  explore  a larger  range  of  values  for  each  of  the 
factors  in  this  experiment,  say  15  to  45  percent  solids,  10  to  60  minutes' 
leaching  time,  and  15°  to  90°  C.  The  set  of  eight  tests  in  the  two- level  fac- 
torial design  previously  described  can  be  the  first  step  in  an  exploration  of 
the  larger  region.  Evaluation  of  the  results  of  this  set  will  show  how  to 
vary  the  three  factors  simultaneously  to  find  the  optimum  leaching  conditions 
within  the  region.  The  results  may  show,  for  example,  that  starting  with  the 
central  value  of  each  factor  in  the  initial,  two- level,  factorial  design  the 
optimum  conditions  will  be  approached  most  rapidly  if  the  percent  solids  is 
increased  by  7 percent  and  time  is  increased  by  4.5  minutes  for  each  tempera- 
ture increase  of  10°  C.  The  procedure  is  outlined  as  follows:  A set  of  tests 
is  made,  evaluation  of  the  data  from  this  set  determines  another  set  of  tests 
to  be  made,  evaluation  of  the  data  from  the  second  set  determines  still 
another  set,  and  the  procedure  is  continued  until  the  optimum  conditions  are 
located  in  the  region.  It  is  possible  to  locate  the  optimum  conditions  with 
more  certainty  and  with  fewer  tests  in  this  way  than  by  the  one-factor-at-a- 
time  approach. 
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To  develop  a single  method  for  analyzing  the  results  of  factorial  experi- 
ments regardless  of  particular  levels  selected  for  the  various  factors,  it  is 
necessary  to  define  the  concepts  of  base  level  and  design  unit  for  each  factor 
in  an  experiment.  The  base  level  for  each  factor  in  a two-level  factorial 
design  is  the  average  level  of  that  factor.  Therefore,  the  base  levels  in  the 
initial  two-level  factorial  leaching  experiment  are  30  percent  solids,  35  min- 
utes' leaching  time,  and  30°  C.  leaching  temperature.  The  design  unit  for 
each  factor  in  a two- level  experiment  is  the  difference  between  the  higher  and 
lower  levels.  The  design  units  in  the  leaching  experiment  are  10  percent 
solids,  10  minutes,  and  10°  C.  The  use  of  these  concepts  corresponds  to  the 
translation  of  coordinate  axes  of  the  response  function  from  the  natural  ori- 
gin at  0 percent,  0 minutes,  and  0°  C.  to  the  base  levels  of  each  factor,  and 
to  the  change  in  scale  from  natural  units  to  design  units.  The  lower  level  of 
each  factor  is  simply  one-half  a design  unit  below  the  base  level,  and  the 
higher  level  is  one-half  a design  unit  above  the  base  level.  These  concepts 
form  the  basis  for  subsequent  discussions  of  factorial  designs  and  the  analy- 
sis of  results  obtained.  The  relative  locations  of  the  various  tests  in  the 
three-factor  two- level  factorial  design  are  shown  geometrically  in  figure  8. 

In  this  figure,  the  lower  levels, -1/2,  represent  25  percent  solids  for  Xa,  30 
minutes'  leaching  time  for  X^,  and  25°  C.  leaching  temperature  for  X c.  The 
higher  levels,  +1/2,  represent  35  percent,  40  minutes,  and  35°  C. , respec- 
tively, for  Xa,  Xfc,  and  Xc.  Each  design  point  is  identified  by  the  test  sym- 
bol, the  coordinates  of  the  point  in  design  units,  and  the  response  obtained 
from  the  test  at  the  point.  In  the  discussion  and  illustration  of  two-level 
factorial  designs,  the  subscripts  are  omitted  from  the  test  symbols. 

The  various  main  and  interaction  effects  of  the  three  factors  upon  the 
amount  of  copper  leached  are  estimated  by  means  of  appropriate  differences  in 
the  responses  obtained  under  different  test  conditions.  For  two-level  facto- 
rial designs,  each  main  and  interaction  effect  that  can  be  estimated  has  only 
one  polynomial  component.  Hence,  in  two-level  designs,  one  may  speak  of  the 
coefficient  of  an  effect,  instead  of  the  coefficient  of  a component  of  the 
effect.  For  example,  the  estimated  coefficient  A of  the  linear  main  effect  of 
percent  solids  is  equal  to  the  difference  between  the  average  response  for  the 
four  tests  at  35  percent  and  the  average  response  for  the  four  tests  at  25 
percent;  that  is. 


A (l/4)(yail  + y81  s + y22I  + y222) 

-(l/4)(yill  + ylis  + y121  + y122).  (141) 

If  there  were  no  interaction  effect  between  percent  solids  and  leaching  time, 
estimates  obtained  from  tests  made  at  the  higher  and  lower  levels  of  time 
should  be  equal.  When  such  an  interaction  exists,  the  corresponding  coeffi- 
cient AB  will  be  equal  to  one-half  the  difference  between  the  two  estimates; 
that  is, 

AB  - ( l/2)[( l/2)(yaai  + y232  “ yj 21  " y122 
(l/2)(yail  + y212  “ yiai  " yn8)] 

= <iM>[ya81  + yssa  +ym  + yll8) 

(?i8i  + yi  as  + ^ai  i + ya i s ^ ' 


(142) 
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FIGURE  8,  - Diagram  of  a Two-Level  Factorial  Design  for  Three  Factors. 

The  coefficient  ABC  of  the  three-factor  interaction  effect  of  percent 
solids,  time,  and  temperature  is  equal  to  one-half  the  difference  between  two 
estimates  of  the  coefficient  AB  obtained  from  tests  made  at  the  higher  and 
lower  levels  of  temperature;  that  is, 

ABC  - (l/2)[(l/2)(yasa  + yua  - yiaa  - 

(l/2)(yaal  + yjjj  - y1  S1  - y811  )] 

— 3^1  l a ^ si  Yoi  i ) 
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Similar  linear  combinations  of  the  responses  will  yield  calculations  of  the 
coefficients  B,  C,  AC,  and  BC.  The  calculations  of  coefficients  of  all  main 
and  interaction  effects  may  be  verified  by  using  the  methods  given  in  the  sec- 
tions on  orthogonal  transformation  of  response  functions. 

The  theory  for  two- level  factorial  designs  may  be  developed  further  by 
focusing  attention  upon  the  signs  associated  with  each  response  in  the  linear 
combinations  used  to  calculate  the  coefficients.  The  transformed  response 
functions  may  be  used  to  find  the  appropriate  linear  combinations.  However, 
there  is  a simpler  procedure  which  may  be  used  for  two-level  designs.  The 
procedure  is  algebraic  and  involves  substituting  the  test  symbols  shown  in 
figure  8 for  the  corresponsing  responses  in  the  calculations  for  the  coeffi- 
cients. If  the  fraction  (1/4)  is  omitted,  the  right-hand  members  of  equations 
(141),  (142),  and  (143)  become 

(a  + ac  + ab  + abc)  - [(1)  + c + b + be], 

[ab  + abc  + (1)  + c]  - (b  + be  + a + ac),  (144) 

(abc  + c + b + a)  - [be  + ac  + ab  + (1)]. 

The  quantities  in  the  array  (144)  may  be  factored  respectively  as  follows 

(a  - l)(b  + l)(c  + 1), 

(a  - l)(b  - l)(c  + 1),  (145) 

(a  - l)(b  - l)(c  - 1). 

Note  that  the  minus  signs  in  the  expressions  of  array  (145)  correspond  to  the 
factors  involved  in  the  coefficient  to  be  estimated;  that  is,  (a  - 1)  corre- 
sponds to  factor  Xa,  percent  solids;  (b  - 1)  corresponds  to  factor  Xb,  leach- 
ing time;  and  (c  - 1)  corresponds  to  factor  Xj.,  leaching  temperature.  Similar 
expressions  for  the  coefficients  B,  C,  AC,  and  BC,  respectively,  are 

(a  + l)(b  - l)(c  + 1), 

(a  + l)(b  + l)(c  - 1), 

(146) 

(a  - l)(b  + l)(c  - 1), 

(a  + l)(b  - l)(c  - 1). 

Therefore,  in  the  general  case  expressions  similar  to  those  in  arrays  (145) 
and  (146)  may  be  set  up  with  minus  signs  corresponding  to  the  factors  involved 
in  the  coefficient  to  be  estimated,  and  then  those  expressions  expanded  to 
obtain  the  proper  signs  for  the  tests  as  shown  in  the  expressions  of  array 
(144).  The  same  signs  then  are  used  in  the  linear  combinations  of  the  corre- 
sponding responses  which  are  used  to  calculate  the  coefficients. 


The  signs  to  be  associated  with  the  responses  for  the  various  tests  when 
calculating  the  coefficients  for  the  three-factor,  two- level,  factorial  copper 
leaching  experiment  are  summarized  in  table  5.  Tables  of  this  kind  are  useful 
in  designing  two- level  fractional  factorial  experiments,  and  for  this  reason 
the  column  for  the  average  effect  coefficient,  I,  is  included.  The  upper  left- 
hand  corner  of  the  table  can  be  used  for  a two- factor  experiment.  The  table 
can  be  extended  for  use  with  four  or  more  factors.8 

TABUE  5.  - Signs  to  be  associated  with  the  responses  for  the  tests  in 
a three- factor . two- level,  factorial  experiment  when 
calculating  the  coefficients  of  the  main  and  inter- 
action  effects  for  the  factors  Xa,  Xj,,  and  X<. 


Test  No. 

Coefficient 

I 

A 

B 

ab  ; 

c 

AC 

BC 

ABC 

(1) 

+ 

- 

- 

1 

+ 1 

1 

- 

+ 

+ 

- 

a 

+ 

+ 

- 

1 

1 

- 

- 

+ 

+ 

b 

+ 

- 

+ 

1 

1 

1 

- 

+ 

- 

+ 

ab 

+ 

+ 

+ 

1 

+ ! 

1 

- 

- 

- 

- 

c 

+ 

- 

- 

+ 

+ 

- 

- 

+ 

ac 

+ 

+ 

- 

- 

+ 

+ 

- 

- 

be 

+ 

- 

+ 

- 

+ 

- 

+ 

- 

abc 

+ 

+ 

+ 

+ 

+ 

+ 

+ 

+ 

To  understand  how  the  table  is  extended  for  more  factors,  notice  the 
signs  in  columns  A,  B,  and  C.  Hie  minus  sign  in  column  A corresponds  to  a 
test  in  \rfiich  factor  Xa  is  at  its  lower  level,  and  the  plus  sign  corresponds 
to  a test  in  which  factor  Xa  is  at  its  higher  level.  The  same  is  true  for 
columns  B and  C and  the  factors  Xj,  and  Xc,  respectively.  For  each  test,  the 
signs  in  columns  A,  B,  and  C agree  with  the  signs  of  the  coordinates  of  the 
test  point  in  figure  8.  The  sign  in  column  AB  for  any  row  is  the  product  of 
the  signs  for  that  same  row  in  columns  A and  B.  The  signs  in  columns  AC,  BC, 
and  ABC  are  found  in  the  same  way. 

To  extend  the  table  for  a fourth  factor,  X^,  the  test  symbols  for  the 
three-factor  factorial  design  in  the  left  column  are  each  multiplied  by  the 
letter  d,  that  is,  d,  ad,  bd,  abd,  cd,  acd,  bed,  and  abed.  Eight  additional 
rows  are  added  to  the  table  corresponding  to  these  additional  test  symbols  in 

eDavies,  0.  L. , Design  and  Analysis  of  Industrial  Experiments: Hafner  Publish- 
ing  Co.,  New  York,  N.Y. , 1956,  table  M,  p.  624a. 
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the  order  given.  Similarly,  the  coefficient  symbols  at  the  head  of  the  remain- 
ing columns  are  each  multiplied  by  the  letter  D to  give  D,  AD,  BD,  ABD,  CD, 

ACD,  BCD,  and  ABCD.9  Eight  additional  columns  corresponding  to  these  addi- 
tional coefficients  are  added  to  the  table  in  the  order  given.  The  signs  in 
the  eight  new  rows  for  the  first  eight  columns  of  the  extended  table  will  be 
the  same,  respectively,  as  the  signs  in  the  eight  rows  of  table  5,  that  is, 
row  d will  be  the  same  as  row  (1),  row  ad  will  be  the  same  as  row  a;  and  so 
on.  Minus  signs  are  placed  in  the  first  eight  rows  of  column  D of  the 
extended  table,  and  plus  signs  are  placed  in  the  last  eight  rows  of  this  column. 
Hie  signs  in  the  remaining  seven  added  columns  are  obtained  as  products  of  the 
appropriate  corresponding  signs  in  columns  A,  B,  C,  and  D as  previously 
explained  for  table  5.  The  table  may  be  successively  extended  in  a similar 
way  for  any  number  of  factors. 

If  one  of  the  factors  in  a two-level  experiment  were  qualitative  instead 
of  quantitative,  then  either  level  of  that  factor  would  be  selected  as  the 
lower  level  and  considered  to  be  one-half  a design  unit  below  an  artificial 
base  level.  The  other  level  then  is  considered  to  be  the  higher  level,  one- 
half  a design  unit  above  the  artificial  base  level.  The  effect  of  the  quali- 
tative factor  has  one  artificial  linear  component  for  which  the  corresponding 
coefficient  may  be  calculated  from  the  results  of  a two-level  factorial 
experiment.  Similarly,  interaction  effects  involving  one  or  more  qualitative 
factors  may  be  considered  to  have  corresponding  artificial  polynomial 
components. 

Factorial  Designs  With  More  than  Two  Levels  per  Factor 

Factorial  designs  in  which  more  than  two  levels  are  selected  for  some  or 
all  of  the  factors  are  used  when  a more  complete  investigation  of  the  relation- 
ships between  factors  and  responses  is  desired.  Two- level  factorial  designs 
can  be  used  to  explore  a region  only  when  the  main  effects  of  the  factors  are 
linear  or  approximately  linear  over  the  region,  and  interaction  effects  are 
relatively  small  compared  to  the  main  effects.  However,  when  main  effects  are 
not  linear  over  the  region  of  interest  and  interaction  effects  are  relatively 
large,  more  than  two  levels  for  the  factors  are  necessary  for  a complete  study 
of  the  response  surface. 

Typical  experiments  wherein  the  factors  must  be  tested  at  more  than  two 
levels  are  illustrated  as  follows:  An  oxidized  nickel  ore  is  subjected  to  a 
reducing  roast  to  convert  the  metal  oxide  to  metal.  At  600°  and  700°  C. 
roasting  temperature,  80  and  82  percent  of  the  nickel,  respectively,  are  con- 
verted to  metallic  form.  Comparison  with  an  available  estimate  of  experimen- 
tal error  shows  this  difference  to  be  insignificant.  There  may  be  several 
explanations  for  these  results:  (1)  The  maximum  possible  reduction  may  have 
been  obtained,  (2)  a still  higher  temperature  may  be  necessary  to  break  down 
certain  compounds  in  the  ore  which  represent  18  to  20  percent  of  the  metal, 
or  (3)  side  reactions  at  700°  C.  may  cause  some  of  the  reduced  metal  to 
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recombine  with  other  materials  present.  In  the  absence  of  additional  informa- 
tion about  the  reactions  involved,  it  is  possible  that  a higher  percent  reduc- 
tion might  be  obtained  either  at  650°  or  at  800°  C.  If  such  possibilities  are 
suspected  in  advance,  the  original  experimental  design  should  include  at  least 
three  levels  for  temperature. 

Now,  suppose  a second  factor,  time,  also  is  to  be  studied  at  the  levels 
1 and  2 hours.  The  four  tests  in  a two- level  factorial  experiment  may  be  test 
(1):  600°  C. , 1 hour;  test  a:  800°  C. , 1 hour;  test  b:  600°  C. , 2 hours;  and 
test  ab:  800°  C. , 2 hours.  The  results  for  tests  (1),  a,  and  b may  be  89,  90, 
and  87  percent  reduction,  respectively.  Either  the  increase  in  temperature  or 
in  time  appears  to  yield  an  increase  in  percent  reduction.  However,  the 
result  for  test  ab  is  only  83  percent  reduction.  If  experimental  error  were 
less  than,  say,  1 percent,  such  a result  would  be  evidence  of  an  important 
interaction  effect  between  time  and  temperature.  The  design  should  be 
expanded  to  include  a third  level  for  each  factor,  say  700°  C.  and  1.5  hours, 
to  further  evaluate  the  effects. 

If  there  were  evidence  to  indicate  that  a response  has  a maximum  value  at 
some  interior  point  in  an  experimental  region  and  has  lower  values  around  the 
boundary  of  that  region,  it  would  be  necessary  in  locating  the  maximum  value 
to  include  at  least  three  levels  for  each  factor  in  a design  which  covers  the 
region.  The  success  in  locating  the  maximum  value  by  using  a specific  number 
of  levels  will  depend  upon  the  relative  importance  of  the  higher  order  compo- 
nents that  are  not  estimated  by  the  design.  In  the  procedure  for  determining 
optimum  conditions  for  a process,  the  response  values  obtained  may  increase  to 
a point  and  then  begin  to  decrease  again.  In  the  neighborhood  of  such  a 
point,  a factorial  design  with  at  least  three  levels  per  factor  must  be  used 
to  estimate  quadratic  components  of  the  main  effects  of  the  factors  and  pin- 
point the  optimum  conditions. 


The  base  levels  and  design  units  for  factorial  experiments  of  three 
levels  or  more  are  defined  in  a manner  similar  to  that  for  two-level  designs. 
The  base  level  for  a factor  is  always  the  average  value  of  the  factor  levels 
in  the  design  in  terms  of  the  original  factor  units.  The  design  unit  is  the 
difference  between  two  adjacent  levels.  Therefore,  in  a roasting  experiment 
to  study  the  effect  of  temperature  at  600°,  700°,  and  800°  C. , the  base  level 
is  700°  C.  When  the  effect  of  time  is  to  be  studied  at  1,  2,  3,  and  4 hours, 
the  base  level  is  2.5  hours.  The  corresponding  design  units  in  these  examples 
are  100°  C.  and  1 hour,  respectively.  The  levels  of  each  factor  are  equally 
spaced;  otherwise,  the  discussion  of  the  design  and  subsequent  analysis  of 
results  is  more  complicated.  In  a specific  research  problem,  it  might  be 
preferable  to  study  the  effect  of  temperature  by  using  equal  spacing  of  values 
of  the  function  e^  in  the  design,  instead  of  equal  spacing  of  T values,  and  to 
make  similar  transformations  of  the  units  of  other  factors.  The  base  levels 
and  design  units  then  are  defined  in  terms  of  the  transformed  variables. 

The  general  procedures  for  calculating  the  estimated  coefficients  of  the 
various  effect  components  for  factorial  experiments  are  given  in  the  sections 
on  orthogonal  transformation  of  response  functions.  Tables  of  signs  to  be 
associated  with  the  responses  for  the  various  tests  are  not  prepared  for 


designs  involving  three  or  more  levels  per  factor.  There  are  only  two  alge- 
braic signs,  and  the  convenience  of  associating  one  sign  with  each  level  of  a 
factor  is  lost. 

The  factors  in  the  previous  examples  are  quantitative.  It  also  is  possi- 
ble to  test  for  effects  of  qualitative  factors  in  the  same  way.  One  qualita- 
tive factor  which  often  is  studied  in  metallurgical  research  is  the  type  of 
acid  to  be  used  for  leaching,  and  two  levels  which  may  be  selected  are  hydro- 
chloric acid  and  sulfuric  acid.  In  this  case,  the  concepts  of  base  level  and 
design  unit  have  no  physical  meaning,  but  procedures  to  evaluate  the  main  and 
interaction  effects  involving  qualitative  factors  are  the  same  as  for  quantita- 
tive factors.  The  only  exception  is  that  if  three  or  more  levels  are  selected 
for  a qualitative  factor,  the  linear,  quadratic,  and  higher  order  components 
of  the  effects  have  no  physical  meaning.  Therefore,  only  the  overall  effects 
are  estimated. 


Fractional  Factorial  Designs  and  Confounding 

It  is  not  always  practical  or  necessary  to  make  all  the  tests  in  a full 
factorial  design.  As  the  number  of  factors  to  be  tested  in  an  experiment 
increases,  the  number  of  tests  in  a full  factorial  design  increases  rapidly. 

If  some  of  the  interaction  effects  are  insignificant,  it  may  be  possible  to 
reduce  the  number  of  tests  in  a design  by  confining  the  estimates  of  the 
various  effects  so  that  only  the  important  effects  need  be  studied  in  an 
experiment.  When  the  estimates  of  effects  thus  are  combined,  they  are  said 
to  be  confounded. 

Fractional  factorial  designs  and  the  principle  of  confounding  are  partic- 
ularly useful  vdien  it  is  necessary  to  introduce  minor  factors  into  an  experi- 
ment. Such  factors  are  of  no  interest  in  themselves,  but  it  is  important  to 
plan  the  experiment  so  that  the  effects  of  these  minor  factors  do  not  inter- 
fere with  the  estimates  of  the  effects  of  major  experiment  factors.  For 
example,  a two-level  factorial  experiment  to  study  the  effects  of  four  physico- 
chemical factors  requires  16  tests.  To  obtain  estimates  of  the  desired 
effects,  the  other  experiment  conditions  must  be  constant.  This  may  be  impos- 
sible, particularly  if  the  tests  of  the  experiment  must  be  made  in  two  or  more 
sets  on  different  days  or  by  different  men  so  as  to  complete  the  entire 
experiment  in  1 day.  In  designing  experiments  involving  uncertainty  about 
control  of  minor  factors,  the  number  of  tests  to  be  made  is  not  changed,  but 
the  manner  in  which  the  tests  are  divided  into  blocks  should  be  planned  so 
that  any  effect  due  to  making  the  tests  in  different  blocks  will  not  be  con- 
founded with  important  effects  of  the  major  factors  being  tested. 

A two-level  leaching  experiment  may  be  designed  to  study  the  effects  of 
time,  Xa,  at  1 and  2 hours;  temperature,  Xj,,  at  25s  and  50°  C. ; type  of  acid, 

X , using  hydrochloric  and  sulfuric  acids,  and  concentration  of  acid,  Xj,  at 
50  and  100  grams  per  liter  in  the  leach  solution.  There  are  16  tests  in  the 
full  replicate  of  this  design,  and  the  results  of  such  an  experiment  may  be 
used  to  calculate  the  coefficients  A,  B,  C,  D,  AB,  AC,  AD,  BC,  BD,  CD,  ABC, 

ABD,  ACD,  BCD,  and  ABCD.  Suppose  two  men  each  make  eight  of  the  tests,  all 
tests  at  25°  C.  are  made  by  one  man,  and  all  tests  at  50s  C.  are  made  by  the 
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other  man.  If  there  were  any  difference  in  the  performance  of  the  two  men, 
the  effects  of  the  difference  in  performance  upon  the  response,  in  this 
instance,  metal  extraction,  would  be  confounded  with  the  effect  of  temperature. 
The  only  valid  conclusion  which  can  be  drawn  from  the  results  obtained  is  that 
the  coefficient  B in  this  case  is  due  either  to  temperature  or  the  difference 
in  performance  of  the  two  men  or  both  factors.  Only  if  it  were  known  that  the 
performances  of  the  two  men  are  essentially  the  same  would  the  coefficient  B 
be  attributed  to  temperature.  However,  it  may  not  be  convenient  to  plan  and 
conduct  an  experiment  to  compare  the  performances  of  the  two  men  before  con- 
ducting the  main  leaching  experiment.  The  effect  of  a difference  in  perform- 
ance can  be  confounded  with  one  of  the  less  important  interaction  effects 
instead  of  with  the  main  effect  of  temperature  as  in  the  above  example. 

Method  of  Confounding  Effects  by  Table  of  Signs 

The  procedures  for  confounding  effects  in  fractional  factorial  designs 
are  based  on  the  principles  of  mathematical  group  theory.  Before  describing 
the  general  procedures  for  confounding  effects,  the  method  involving  use  of  a 
table  of  signs  will  be  illustrated.  This  method  can  be  used  for  two-level 
designs  because  a minus  sign  can  be  associated  with  the  lower  level  and  a plus 
sign  with  the  higher  level  of  each  factor.  The  method  based  upon  a table  of 
signs  cannot  be  used  if  any  factors  are  to  be  tested  at  more  than  two  levels, 
later,  the  procedure  for  confounding  will  be  presented  in  a more  general  and 
more  abstract  way  for  use  with  designs  in  which  the  factors  are  tested  at  any 
number  of  levels. 

The  method  of  confounding  by  the  use  of  signs  now  is  illustrated  for  a 
three- factor  experiment  with  a two- level  design  in  which  it  is  necessary  to 
estimate  the  main  effects  of  factors  Xa,  Xb,  and  Xc  and  in  which  all  interac- 
tions between  the  three  factors  are  nonexistent  or  negligible.  In  a typical 
instance  factor  Xc  may  be  the  necessary  result  of  investigating  factors  Xa  and 
Xb  in  two  blocks  of  tests  due  to  limitations  of  equipment,  time,  or  personnel. 
The  first  step  is  to  construct  a table  of  signs  to  be  associated  with  the 
responses  for  the  tests  when  calculating  the  coefficients  of  the  various 
effect  components.  Such  a table  is  illustrated  in  table  5.  The  second  step 
is  to  select  four  of  the  eight  tests  in  table  5 so  that  the  main  effect  of  the 
block  factor  Xc  is  confounded  with  the  interaction  effect  between  time,  Xa, 
and  temperature,  X^.  This  is  accomplished  by  selecting  those  tests  from  table 
5 for  which  the  sign  in  column  AB  is  the  same  as  the  sign  in  column  C.  These 
tests  and  their  signs  then  are  incorporated  in  a second  table  as  shown  in 
table  6.  The  first  two  tests  are  made  at  the  lower  level  of  factor  Xc,  and 
the  last  two  tests  are  made  at  the  higher  level;  that  is,  tests  a and  b are 
made  in  the  first  block,  and  tests  c and  abc  are  made  in  the  second  block. 


TABLE  6.  - Tests  selected  from  the  eight  tests  of  a three-factor,  two- level. 

factorial  experiment  in  such  a wav  that  the  interaction  coef- 
ficient AB  is  confounded  with  the  main  coefficient  C 


Test 

Coefficient 

symbol 

I 

A 

B 

AB 

C 

AC 

BC 

ABC 

a 

+ 

+ 

- 

- 

- 

- 

+ 

+ 

b 

+ 

- 

+ 

- 

- 

+ 

- 

+ 

c 

+ 

- 

- 

+ 

+ 

- 

- 

+ 

abc 

+ 

+ 

+ 

+ 

+ 

+ 

+ 

+ 

Whenever  the  tests  of  a full  factorial  design  are  selected  so  as  to  con- 
found two  particular  effects,  all  effects  will  be  confounded  In  pairs.  Table 
6 Illustrates  the  confounding  of  the  effects  whose  coefficients  are  AB  and  C. 

A study  of  the  table  shows  that  the  effects  vftiose  coefficients  are  B and  AC 
also  are  confounded.  The  same  Is  true  of  the  effects  whose  coefficients  are 
A and  BC  and  of  those  whose  coefficients  are  I and  ABC.  The  nature  of  the 
minor  factors  which  cause  block  effects  Is  such  that  only  the  main  effects  are 
considered  to  be  Important.  The  metallurgist  must  be  careful  to  conduct  an 
experiment  so  as  to  minimize  such  effects,  and  the  principle  of  confounding 
may  be  used  as  a precaution  against  anticipated  or  unforeseen  variations  in 
test  conditions.  In  this  illustration  the  interaction  effects  between  blocks 
and  the  major  experiment  factors,  time  and  temperature,  and  the  corresponding 
coefficients  AC,  BC,  and  ABC  are  negligible.  However,  in  the  more  general 
case,  it  is  necessary  to  study  the  complete  system  of  confounding  which 
results  when  two  effects,  or  their  coefficients,  are  selected  to  be  confounded. 


To  show  further  how  the  effects  and  their  corresponding  coefficients  are 
confounded  by  selecting  the  four  tests  in  table  6,  let  particular  factor 
levels  for  factors  Xg,  Xj,,  and  Xc  be  identified  by  the  letters  i,  j,  and  k, 
respectively.  The  response  function  for  a three-factor,  two-level  experiment 
then  will  be 


yijk  " ^oootijk)  + AZioo(ijk)  + BZoio(ijk)  + CZooi(ijk)  + ABZno(ijk) 

+ ACZi  01  (i  jk)  + BCZon  (ijk)  + ABCZi  1 1 ( i jk) 

« I ± M ± %B  ± ± MB  ± MC  ± %BC  ± ( 1/8)ABC.  (147) 


If  each  test  in  table  6 were  made  once,  the  average  response  yijk  for  each 
test  would  be  the  single  response  y^jk  obtained.  The  terms  in  equation  (147) 
may  be  rearranged  for  the  four  tests  a,  b,  c,  and  abc,  respectively,  to  give 
the  following  algegraic  sums: 


yBll  E*  + 

y18i  - [i  + 

yi  IS  + 

yS3S  E1  + 


( 1/8)ABC]  + 
(1/8)ABC]  - 
( 1/8)ABC]  - 
( 1/8)ABC]  + 


(%A  + %BC) 
(%A  + %BC) 
(M  + *BC) 
(M  + %BC) 


- (1[B  + MC) 
+ (%B  + UC) 

- (%B  + MC) 
+ (%B  + UC) 


- (%c  + %AB), 

- (%c  + MB), 
+ (^c  + Mb), 
+ (%c  + Mb). 


(148) 


Now  the  four  responses  and  their  equalities  in  the  equation  set  (148)  may  be 
combined  algebraically  to  calculate 


c + mb  - « - y811  - ylal  + yn9  + yaaa). 


(149) 


Since  the  data  from  the  four  tests  cannot  be  used  to  calculate  C and  AB  sepa- 
rately, but  only  the  sum,  C + %AB,  the  two  coefficients  are  said  to  be  con- 
founded with  each  other.  The  three  other  sums  of  confounded  coefficients 
which  may  be  calculated  from  the  data  are 


I + ( 1/8)ABC,  A + %BC,  and  B + MC 


(150) 


The  half  replicate  of  a three-factor,  two-level,  factorial  design  that 
consists  of  the  block  of  four  tests  in  table  6 is  an  adequate  design  if  at 
least  one  of  the  coefficients  in  each  sum  is  known  to  be  negligible.  The  sec- 
ond coefficient  in  each  pair  in  the  set  of  sums  (150)  is  an  interaction  coef- 
ficient involving  factor  Xc.  As  no  interaction  effects  exist  between  differ- 
ent blocks  of  tests  and  the  experiment  factors  time  and  temperature,  the 
second  coefficient  in  each  pair  may  be  considered  to  be  equal  to  zero.  The 
corresponding  algebraic  combinations  of  the  responses,  therefore,  may  be  set 
equal  to  the  first  coefficient  in  each  pair.  If  the  coefficient  AB  were  neg- 
ligible for  the  range  of  levels  selected  for  each  factor,  the  coefficient  C 
might  be  calculated. 

The  other  block,  which  can  be  used  for  the  experiment  to  give  the  same 
system  of  confounding,  is  the  half  replicate  consisting  of  the  four  remaining 
tests  in  table  5,  that  is,  tests  (1),  ab,  ac,  and  be.  These  could  have  been 
found  first  by  selecting  those  tests  in  table  5 which  have  opposite  signs  in 
columns  AB  and  C.  In  this  case,  four  algebraic  combinations  of  the  responses 
may  be  obtained  which  are  equal  to 

C - ( 1/2)AB,  I - ( 1/8)ABC,  A - (1/2)BC,  and  B - (1/2)AC.  (151) 

When  these  assumptions  are  valid,  either  the  four  tests  in  table  6 or  this 
second  block  of  four  tests  may  be  made  to  obtain  calculations  of  the  desired 
coefficients. 

Coefficients  that  are  confounded  with  each  other  in  a fractional  facto- 
rial design  are  called  aliases.  Therefore,  in  either  of  these  designs  con- 
sisting of  four  tests,  the  coefficients  C and  AB  are  aliases,  the  coefficients 
I and  ABC  are  aliases,  and  so  on.  In  general,  a two- level  factorial  experi- 
ment to  investigate  n factors  contains  2n  tests,  and  the  responses  obtained 
can  be  used  to  calculate  the  average  effect  coefficient  I and  (2n  - 1)  coeffi- 
cients of  the  possible  components  of  the  factor  effects.  These  2n  coeffi- 
cients all  will  be  confounded  in  pairs  of  aliases  whenever  a half  replicate  of 
such  an  experiment  is  designed.  The  value  of  such  designs  depends  upon  the 
ability  to  select  tests  in  such  a way  that  no  two  important  coefficients  will 
be  confounded  with  each  other. 

More  complex  systems  of  confounding  can  be  used  to  reduce  the  number  of 
tests  necessary  for  an  investigation  if  interaction  effects  of  major  factors 
are  negligible.  For  example,  if  five  factors  are  to  be  tested  and  all  inter- 
action effects  are  known  to  be  negligible,  only  the  main  effect  of  each  factor 
needs  to  be  estimated.  The  full  two-level  factorial  design  for  five  factors 
contains  32  tests.  The  eight  tests  in  a quarter-replicate  design  may  be 
selected  in  such  a way  that  algebraic  combinations  of  responses  may  be  used  to 
calculate  the  following  sums  of  aliases: 
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I + (1/8)ABD  + ( 1/8)ACE  + (1/16)BCDE, 

A + ( 1/2)BD  + ( 1/2)CE  + ( 1/ 16) ABODE, 

B + ( 1/ 2)AD  + ( 1/ 8)ABCE  + (1/4)CDE, 

C + ( 1/8)ABCD  + ( 1/2)AE  + (1/4)BDE, 

D + ( 1/ 2)AB  + ( 1/8)ACDE  + (1/4)BCE,  <152> 

E + ( 1/2)AC  + (1/8)ABDE  + (1/4)BCD, 

DE  + BC  + (1/2)ABE  + (1/2)ACD, 

BE  + CD  + ( 1/2) ABC  + (1/2)ADE. 

The  six  coefficients  I,  A,  B,  C,  D,  and  E are  calculated  from  the  responses  of 
the  eight  tests  selected.  Also,  since  all  interaction  effects  are  negligible, 
the  algebraic  combinations  of  responses  that  give  the  last  two  sums  of  aliases 
can  be  used  for  an  estimate  of  experimental  error. 

Such  a design  also  may  be  used  when  it  is  necessary  to  divide  the  tests 
of  a factorial  design  into  four  blocks  of  tests.  For  example,  a two-level 
factorial  design  for  the  four  factors  Xa,  Xj,»  Xc,  and  X<j  requires  16  tests. 

Now,  suppose  the  required  tests  must  be  made  in  four  blocks  of  four  tests  each. 
The  effects  of  making  the  tests  in  four  different  blocks  may  be  treated  as  the 
effects  of  two  additional  factors  Xe  and  Xf,  each  at  two  levels.  The  16  tests 
then  become  a quarter  replicate  of  a six-factor,  two-level  experiment. 

The  only  important  effects  involving  blocks  will  correspond  to  the  main 
effect  coefficients  E and  F and  the  interaction  effect  coefficient  EF.  The 
experiment  may  be  designed  so  that  important  coefficients  of  the  original  four 
factors  will  not  be  confounded  with  E,  F,  or  EF.  This  may  be  accomplished  by 
first  constructing  a table  of  signs  for  a full  six-factor,  two- level,  facto- 
rial experiment  by  procedures  discussed  in  the  section  on  two-level  designs. 
From  this  table,  32  tests  may  be  selected  that  will  confound  coefficient  E 
with  an  unimportant  interaction  coefficient  for  the  experiment  factors  Xa,  X^, 
Xj,,  and  Xj.  From  the  table  of  32  tests  and  their  signs  so  obtained,  a selec- 
tion of  16  tests  can  be  made  so  as  to  confound  coefficient  F with  another 
unimportant  interaction  coefficient  for  the  four  experiment  factors. 

General  Methods  for  Confounding  Effects 

The  method  of  confounding  by  means  of  a table  of  signs  is  relatively  sim- 
ple, but  it  is  limited  to  two-level  designs.  Also,  the  method  becomes  rather 
cumbersome  as  the  number  of  factors  increases  and  as  the  confounding  becomes 
more  complex.  The  general  procedure  for  determining  the  tests  to  be  made  to 
confound  effects  in  a fractional  factorial  design  may  appear  highly  abstract 
to  the  reader  who  is  not  familiar  with  mathematical  group  theory.  A short 
discussion  of  group  theory,  therefore,  will  be  presented  before  describing  how 
the  general  procedure  is  employed.  Once  the  basic  principles  of  group  theory 
are  understood,  the  general  procedure  is  more  readily  applied  than  the  method 
based  upon  a table  of  signs. 

Group  Theory.  - A group  is  defined  mathematically  to  be  a set  of  elements 
together  with  an  operation  defined  in  such  a way  that  whenever  the  operation 
is  performed  with  two  elements  in  the  set,  the  result  is  another  element  in 
the  set.  In  addition,  there  must  be  an  identity  element  in  the  set  such  that 
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when  the  operation  is  performed  with  the  identity  element  and  any  second  ele- 
ment, the  result  leaves  the  second  element  unchanged.  Finally,  for  each  ele- 
ment in  the  set,  there  must  be  a second  element  such  that  when  the  operation 
is  performed  with  these  two  elements,  the  result  is  the  identity  element. 

These  two  elements  are  called  inverses  of  each  other.  For  example,  the  set  of 
all  positive  and  negative  whole  numbers  and  the  operation,  addition,  consti- 
tute a group.  The  numbers  5 and  19  are  integers  and  5 + 19  = 24,  also  an 
integer.  Similarly,  7 + (-9)  = -2.  The  identity  element  in  this  case  is  0, 
for  example  0+7=7.  The  inverse  of  an  integer,  say  5,  is  -5,  that  is, 

5 + (-5)  = 0.  There  are  an  infinite  number  of  elements  in  this  group.  Simi- 
larly, the  set  of  all  rational  numbers  and  the  operation,  multiplication,  also 
form  a group  with  an  infinite  number  of  elements.  The  number  zero  has  no 
inverse  under  multiplication,  but  if  all  other  elements  in  a set  have  inverses 
within  the  set,  the  set  together  with  the  operation,  multiplication,  is  a 
group. 

An  example  of  a group  with  a finite  number  of  elements  is  the  set  of  two 
elements,  +1  and  -1,  and  the  operation,  multiplication.  The  structure  of  this 
finite  group  is  illustrated  as  follows: 


# 

+1 

-1 

+1 

+1 

-1 

-1 

-1 

+1 

(153) 


The  identity  element,  +1,  for  the  group  is  listed  first  in  the  column  headings 
and  in  the  row  headings.  The  remaining  values  in  the  table  are  the  products 
of  the  corresponding  row  and  column  elements.  The  dot  in  the  upper  left-hand 
corner  indicates  the  operation,  multiplication.  Each  element  in  this  group  is 
its  own  inverse.  A group  having  two  elements  but  a different  structure  from 
that  shown  in  group  (153)  consists  of  the  integers  zero  and  1 and  the  opera- 
tion, multiplication.  Again  the  identity  element  for  multiplication  is  1,  and 
it  is  listed  first. 


1 0 


1 


1 


0 


0 


0 

0 


(154) 


Tfce  concept  of  integers  modulo  k is  a mathematical  concept  which  is  use- 
ful in  the  study  of  group  theory.  The  concept  may  be  introduced  by  consider- 
ing the  measure  of  time  in  cycles  of  12  hours.  Nine  hours  after  midnight  it 
is  9 o'clock.  However,  5 hours  later,  it  is  not  14  o'clock,  but  2 o'clock. 
Another  11  hours  later,  it  is  not  25  o'clock,  but  1 o'clock.  The  above  state' 
ments  may  be  expressed  symbolically  as  follows:  9+5=2, 9+5+  11  =1. 

If  midnight  and  noon  are  designated  by  zero  instead  of  12,  the  hours  of  the 
clock  represent  the  integers  modulo  12.  These  integers  and  the  operation, 
addition,  constitute  a finite  group.  If  any  two  or  more  of  the  integers  from 
zero  through  11  are  added  in  this  manner,  the  result  also  is  one  of  these 


TF,I; 


integers.  The  identity  member  of  this  group  is  zero,  and  the  additive  inverse 
of  any  other  member  is  the  difference  between  12  and  that  member.  For  example, 
5+7  =0;  in  other  words,  7 hours  after  5 o'clock,  it  is  either  noon  or  mid- 
night. In  general,  the  integers  modulo  k are  those  integers  in  the  set  0,  1, 

2,  ....  (k  -1). 


The  integers  modulo  2 and  the  operation,  addition,  constitute  a group 
with  the  structure  shown  as  follows: 


0 

1 


0 

1 


1 

0 


The  identity  element  in  this  case  is  zero,  and  each  element  is  its  own  inverse. 
The  structure  of  arrangement  (155)  is  the  same  as  that  shown  in  arrangement 
(153).  This  means  that  both  elements  in  each  group  are  found  in  the  same  rela- 
tive locations  in  each  arrangement.  It  is  the  structure  of  this  group, 
whether  expressed  as  in  arrangement  (153)  or  (155),  which  forms  the  basis  of 
the  method  of  confounding  by  means  of  a table  of  signs. 


The  structure  of  the  group  consisting  of  the  integers  modulo  4 and  the 
operation,  addition,  is  as  follows: 


0 

1 

2 


0 

1 

2 


1 

2 

3 


2 

3 

0 


The  identity  element  is  zero,  and  the  inverse  of  each  element  is  the  differ- 
ence between  4 and  that  element.  The  structure  of  this  group  is  illustrated 

by  the  relative  locations  of  each  element  in  the  arrangement  (156).  It  has 
been  shown  that  the  group  consisting  of  the  integers  modulo  2 and  addition  and 
having  the  structure  illustrated  in  the  arrangement  (155)  has  the  same  struc- 
ture as  the  group  consisting  of  the  two  numbers,  +1  and  -1,  and  multiplication 
\diich  has  the  structure  illustrated  in  the  arrangement  (153).  Similarly,  the 
group  consisting  of  the  integers  modulo  4 and  addition  will  be  shown  to  have 

the  same  structure  as  a group  defined  for  the  coefficients  I,  A,  As , and  A3 

for  a one-factor,  four- level  experiment.  However,  before  illustrating  this 
relationship,  it  is  necessary  to  define  a new  concept,  group  theory  multiplica- 
tion. The  operation  is  used  to  obtain  products  of  coefficients  and  thereby  to 
determine  vrfiich  coefficients  are  aliases  in  a fractional  factorial  design. 


In  this  case  the  products  of  two  or  more  coefficients  are  purely  symbolic 
and  do  not  refer  to  the  numerical  values  of  the  coefficients  involved.  When 
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two  different  coefficients  are  set  equal  to  each  other,  the  equal  sign  repre- 
sents the  expression  "is  confounded  with."  In  a four- level  experiment,  the 
coefficient  A4  is  confounded  with  coefficient  I.  Therefore,  the  symbolic 
statement.  A4  =1,  serves  to  define  the  properties  of  group  multiplication  for 
this  experiment.  With  this  definition,  the  product  of  any  coefficients  will 
be  one  of  the  four  coefficients  I,  A,  A2,  and  A3.  Coefficient  I serves  as  the 
identity  element  in  the  group.  Therefore,  A(A2 ) = A3,  A(A3 ) = I,  A2 (A3)  = AI 
= A,  and  so  on.  The  inverse  of  any  element  A1  is  A4-1,  where  i = 1,  2,  3;  for 
example,  A(A3 ) = I.  The  four  coefficients  and  the  group  multiplication  opera- 
tion constitute  a finite  group  with  the  following  structure: 


a3 

A3 

I 

A 

A2 


(157) 


Note  that  the  relative  locations  of  the  powers  i of  A in  the  group  (157)  are 
the  same  as  the  relative  locations  of  the  integers  modulo  4 in  the  group  (156). 
The  two  groups  have  the  same  structure. 


Group  Theory  in  Design.  - The  principles  of  group  theory  form  the  basis 
for  the  general  procedure  for  confounding  effects  in  fractional  factorial 
designs.  The  groups  of  primary  interest  are  those  which  consist  of  sets  of 
coefficients  which  are  calculated  from  the  results  of  full  factorial  experi- 
ments and  the  group  theory  multiplication  operation  extended  for  use  in 
experiments  having  more  than  one  factor. 

The  application  of  group  theory  principles  to  the  design  of  fractional 
factorial  experiments  is  presented  first  in  the  following  discussion  in  terms 
of  two- level  designs.  Two- level  factorial  experiments  are  designed  to  esti- 
mate linear  components  of  the  main  effects  of  the  factors  and  the  correspond- 
ing components  of  the  interaction  effects.  Such  designs  cannot  be  used  to 
estimate  quadratic  components  of  the  main  effects.  An  application  of  the 
method  of  confounding  based  upon  a table  of  signs  would  show  that  all  main 
effect  quadratic  component  coefficients  are  confounded  with  each  other  and 
with  the  average  effect  coefficient  I in  a two- level  design.  This  is 
expressed  in  the  symbolic  notation  of  group  theory  as  follows: 

A2  =B2  = C2  = D2  = . . . =1.  (158) 

Because  of  this  confounding,  the  symbolic  product  of  coefficients  obtained  by 
group  multiplication  results  in  a product  that  is  one  of  the  coefficients  for 
which  a two-level  experiment  is  designed.  For  example,  (AB)(AC)  = A2BC; 
however.  A2  is  confounded  with  I which  serves  as  the  identity  and  is  omitted 
from  any  product  unless  it  stands  alone.  Therefore,  (AB) (AC)  = BC. 
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A four-factor,  two- level  experiment  will  be  used  to  illustrate  the  use  of 
group  theory  for  the  design  of  a half  replicate  of  the  corresponding  full  fac- 
torial design.  The  16  coefficients  which  are  calculated  from  the  results  of 
the  full  replicate  design  are  I,  A,  B,  C,  D,  AB,  AC,  AD,  BC,  BD,  CD,  ABC,  ABD, 
ACD,  BCD,  and  ABCD.  These  constitute  the  set  of  elements  in  the  group  that 
are  to  be  confounded  in  pairs  in  a half-replicate  design.  The  identity  ele- 
ment in  this  group  is  the  coefficient  I.  For  two- level  designs,  each  coeffi- 
cient is  its  own  inverse.  Suppose  D were  to  be  confounded  with  ABC;  that  is, 

D = ABC  in  group  theory  notation.  The  first  step  toward  finding  the  other 
pairs  of  aliases  is  to  determine  vfoich  coefficient  is  confounded  with  I as  a 
consequence  of  the  decision  to  confound  D with  ABC.  If  both  sides  of  the 
expression  D = ABC  are  multiplied  by  the  same  element,  the  symbolic  equality 
is  preserved.  Therefore,  D8  = ABCD,  but  D2  is  confounded  with  I according  to 
the  defining  expression  (158),  and  so  I = ABCD.  After  determining  that  I is 
confounded  with  ABCD,  these  two  coefficients  may  be  multiplied  by  any  other 
coefficient  to  find  its  alias.  For  example,  AI  = A2BCD,  or  A = BCD.  The  16 
coefficients  are  found  to  be  confounded  in  pairs  of  aliases  as  follows: 


D = ABC 
I = ABCD 
A = BCD 
B = ACD 


C = ABD 
AB  = CD 
AC  = BD 
AD  = BC 


(159) 


Any  pair  of  coefficients  in  the  set  of  pairs  (159)  could  have  been  chosen 
initially,  and  the  results  would  have  been  the  same.  For  example,  if  the  ini- 
tial decision  had  been  to  confound  AB  with  CD;  that  is,  AB  = CD,  then  the 
first  step  would  be  to  find  the  alias  of  I by  multiplying  either  AB  or  CD  by 
itself,  for  example,  (AB)(AB)  = (AB)(CD).  The  product  (AB)(AB)  = A2B2  ; both 
A2  and  B2  are  confounded  with  I,  and  so  A2B2  = I2.  The  operation  of  the  iden- 
tity, I,  upon  itself  leaves  it  unchanged,  so  I2  = I,  and  I = ABCD.  Then  the 
other  pairs  of  aliases  would  be  found  just  as  before. 

As  a second  example,  an  experiment  may  be  designed  to  study  the  effects 
of  several  factors  upon  metal  reduction  during  roasting.  Two  quantitative 
factors  to  be  studied  are  residence  time,  Xa,  and  roasting  temperature,  Xb- 
The  reliability  of  the  results  obtained  depends  upon  the  accuracy  of  analyti- 
cal procedures  used  to  assay  the  calcines  produced,  so  a qualitative  factor, 

Xq,  is  included  in  the  design  to  study  two  different  types  of  chemical  analy- 
sis. A fourth  factor,  Xj,  is  included  to  study  the  effects  of  two  different 
types  of  ore  used  in  the  tests.  It  may  be  known  or  assumed  that  the  coeffi- 
cient AC  of  the  interaction  between  the  residence  time  in  the  furnace  and  the 
type  of  chemical  analysis  used  to  assay  the  calcine  is  negligible,  and  so  one 
may  wish  to  confound  D with  AC.  Then  D2  = I = ACD,  AI  = A = A2 CD  = CD,  and 

the  complete  system  of  confounding  is  as  follows: 


D 

= AC 

C = AD 

I 

= ACD 

AB  = BCD 

A 

= CD 

BC  = ABD 

B 

= ABCD 

BD  = ABC 
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Defining  Contrasts  and  the  Principal  Block.  - The  next  step  in  designing 
fractional  factorial  experiments  by  group  theory  methods  is  to  determine  which 
tests  must  be  made  to  obtain  the  desired  confounding  of  coefficients.  This  is 
automatic  in  the  method  of  confounding  by  selecting  tests  from  a table  of 
signs  for  two- level  factorial  designs.  However,  when  using  group  theory 
methods,  it  is  necessary  to  make  use  of  the  concepts  of  defining  contrasts 
and  the  principal  block. 

In  any  system  of  confounding,  the  set  of  defining  contrasts  is  the  set 
of  aliases  which  contains  the  coefficient  I.  This  set  is  so  named  because  it 
serves  to  define  the  complete  system  of  confounding,  and  it  is  used  also  to 
determine  the  tests  which  are  to  be  made.  The  complete  system  of  confounding 
is  defined  by  the  set  of  defining  contrasts,  because  this  set  of  aliases  and 
group  multiplication  are  used  to  find  all  other  sets  of  aliases.  The  tests  to 
be  made  are  selected  by  first  using  the  defining  contrasts  to  find  the  tests 
in  the  principal  block.  Then  the  principal  block  and  a group  multiplication 
operation  defined  for  test  symbols  are  used  to  find  the  tests  in  other  blocks. 
Finally,  one  of  the  blocks  of  tests  is  selected  for  an  experiment. 

The  principal  block  of  tests  is  the  fractional  replicate  design  vrtiich 
contains  the  test  (1);  that  is,  the  test  in  which  all  factors  are  at  their 
lowest  level.  The  group  theory  procedures  require  that  this  set  of  tests  be 
found  first;  then  the  other  possible  sets  of  tests  are  found  by  using  the 
principal  block  and  the  group  multiplication  operation.  The  tests  in  the 
principal  block  are  found  by  expressing  both  tests  and  defining  contrasts  in 
expanded  notations. 

For  example,  in  the  roasting  experiment  with  the  system  of  confounding 
given  by  the  pairs  of  aliases  in  array  (160),  ACD  is  the  coefficient  of  the 
interaction  effect  for  the  factors  residence  time,  type  of  chemical  analysis, 
and  type  of  ore.  The  test  ab  is  made  at  the  higher  levels  of  residence  time 
and  roasting  temperature  and  the  lower  levels  of  type  of  analysis  and  type  of 
ore.  The  coefficient  ACD  also  is  written  A1  BP  C1  D*1  , and  the  test  ab  also  is 
written  a1b1c0dQ.  Each  coefficient  can  be  written  in  the  form  ApB<lCrDt,  and 
each  test  can  be  written  in  the  form  ayb^xdy  where,  in  this  example,  p,  q,  r, 
t,  v,  w,  x,  and  y can  equal  zero  or  1.  In  the  expanded  notation,  the  test 
ayb^jjdy  is  compared  with  the  coefficient  ApB<lCrD‘:  by  obtaining  the  sum 
pv  + qw  + rx  + ty.  For  the  test  ab  and  the  coefficient  ACD,  the  corresponding 
sum  (1)(1)  + (0)(1)  + (1)(0)  + (1)(0)  ■ 1.  On  the  other  hand,  for  the  test  ac 
and  the  coefficient  ACD,  the  sum  isl+0+l+0=2.  The  rule  for  finding 
the  tests  in  the  principal  block  is  expressed  as  follows: 

A test  is  in  the  principal  block  if  the  comparison  between  that  test  and 
each  of  the  defining  contrasts  gives  a sum  equal  to  zero  or  a multiple  of  2; 
that  is,  if. 


pv  + qw  + rx  + ty  = 2n,  n = 0,  1,  2 (161) 

The  comparison  between  any  test  and  the  coefficient  I gives  the  sum 
0+0+0+0=0.  Therefore,  the  essential  comparisons  are  those  between 
each  of  the  tests  and  all  other  coefficients  in  the  set  of  defining  contrasts. 
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In  the  roasting  experiment,  the  defining  contrasts  are  I and  ACD,  and  so  each 
test  must  be  compared  with  coefficient  ACD.  In  this  case,  p=l,q=0,r=l, 
and  t = I.  These  comparisons  are  shown  as  follows: 


Test 

symbol 

pv 

qw 

mx 

ty 

Sum 

(1) 

0 

0 

0 

0 

0 

a 

1 

0 

0 

0 

1 

b 

0 

0 

0 

0 

0 

c 

0 

0 

1 

0 

1 

d 

0 

0 

0 

1 

1 

ab 

1 

0 

0 

0 

1 

ac 

1 

0 

1 

0 

2 

ad 

1 

0 

0 

1 

2 

be 

0 

0 

1 

0 

1 

bd 

0 

0 

0 

1 

1 

cd 

0 

0 

1 

1 

2 

abc 

1 

0 

1 

0 

2 

abd 

1 

0 

0 

1 

2 

acd 

1 

0 

1 

1 

3 

bed 

0 

0 

1 

1 

2 

abed 

1 

0 

1 

1 

3 

The  tests  in  the  principal  block  are  those  for  which  the  sum  is  zero  or 
2;  that  is, 

(1),  b,  ac,  ad,  cd,  abc,  abd,  and  bed.  (162) 

The  other  block  consists  of  the  remaining  tests;  that  is, 

a,  c,  d,  ab,  be,  bd,  acd,  and  abed.  (163) 

Either  of  the  two  blocks  of  tests  (162)  or  (163)  can  be  used  to  obtain  the 
system  of  confounding  given  by  the  pairs  of  aliases  in  array  (160). 

In  a two- level  experiment  with  any  number  of  factors  Xa,  Xj,,  Xc,  . . . , 
X^,  the  tests  in  the  principal  block  are  found  by  calculating  sums  of  products 
of  corresponding  superscripts  and  subscripts  similar  to  the  sum  on  the  left 
side  of  equation  (161).  The  test  ayb^jj...^  is  in  the  principal  block  if  for 
each  coefficient  A^B^Cr...Ku  in  the  set  of  defining  contrasts,  the  sum 

pv  + qw  + rx  + ...  + uz  = 2n,  n = 0,  1,  2,  ...  (164) 

where  p,  q,  r,  ...  , u,  v,  w,  x,  ...  , z = 0 or  1.  If  a larger  number  of  fac- 
tors and,  therefore,  a larger  number  of  tests  are  involved  in  an  experiment, 
it  is  tedious  to  compare  all  tests  with  the  coefficients  in  the  set  of  defin- 
ing contrasts.  After  some  tests  in  the  principal  block  have  been  found,  the 
others  can  be  found  by  a method  now  to  be  given. 
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The  tests  in  the  principal  block  together  with  the  proper  operation  form 
a group.  The  operation  is  group  multiplication  defined  for  two- level  facto- 
rial designs  so  that 


a2  = b2  = c2  = d2  = . . . - 1.  (165) 

A test  symbol  represents  a particular  combination  of  factor  levels  in  an 
experiment  and  is  not  a number  in  the  elementary  algebraic  sense.  Therefore, 
the  product  of  test  symbols  is  purely  a symbolic  device  used  for  finding  the 
tests  in  a fractional  factorial  design.  Expression  (165)  states  that  whenever 
a letter  is  squared  in  the  product  of  tests  in  a two- level  factorial  design, 
the  square  is  either  omitted  from  the  product  or  replaced  by  1.  With  this 
definition,  the  product  of  any  two  tests  in  this  group  will  be  another  test  in 
the  same  group.  For  example,  the  tests  ac  and  ad  are  in  the  principal  block 
(162);  the  product  (ac)(ad)  ■ a3cd  ■ cd  which  also  is  in  the  principal  block. 
To  make  use  of  this  property  in  a systematic  way,  the  tests  may  be  compared  as 
follows:  Compare  the  tests  represented  by  one  letter  in  alphabetical  order 
with  the  defining  contrasts,  then  the  tests  represented  by  two  letters  also  in 
alphabetical  order,  and  so  on  until  three  tests,  other  than  test  (1),  have 
been  found.  These  three  tests  should  be  independent  in  the  sense  that  the 
product  of  any  two  or  all  three  of  them  is  not  one  of  these  three  tests.  The 
first  three  tests  in  this  example  are  b,  ac,  and  ad.  These  may  be  multiplied 
together  in  pairs  to  find  three  additional  tests;  that  is,  (b)(ac)  = abc, 
(b)(ad)  ■ abd,  and  (ac)(ad)  * cd.  All  three  may  be  multiplied  together  to 
find  another  test,  (b)(ac)(ad)  = bed.  There  must  be  eight  tests  in  the  half- 
replicate  design,  so  as  soon  as  eight  tests  have  been  found,  the  design  is 
complete. 

These  eight  tests,  (1),  b,  ac,  ad,  abc,  abd,  cd,  and  bed,  are  the  same  as 
those  previously  found  in  the  principal  block  (162).  If  there  are  16  tests  in 
the  principal  block  of  a fractional  factorial  design,  it  is  necessary  to  find 
four  independent  tests  in  order  to  obtain  enough  products  to  find  the  remain- 
ing tests.  If  there  are  32  tests,  5 independent  tests  must  be  found.  In 
general,  if  there  are  2n  tests  in  the  design,  n independent  tests  must  be 
found. 

A relationship  exists  between  the  tests  in  the  principal  block  and  those 
in  the  other  blocks  for  fractional  factorial  designs  that  is  used  when  there 
are  more  than  two  blocks  from  which  to  choose.  For  example,  in  the  case  of  a 
quarter -replicate  design,  there  are  four  possible  blocks,  and  the  relationship 
is  used  to  find  the  three  nonprincipal  blocks.  This  relationship  uses  the 
group  multiplication  operation  defined  by  expression  (165)  for  the  test  sym- 
bols in  two-level  factorial  designs  and  can  be  explained  by  referring  to  the 
principal  block  (162).  First,  any  test  which  is  not  in  the  principal  block 
must  be  selected.  One  such  test  is  the  test  a.  Then  each  test  in  the  prin- 
cipal block  is  multiplied  by  a to  find  the  other  tests  in  the  second  block. 
Thus  a(l)  = a,  a(b)  = ab,  a(ac)  = c,  and  so  on.  When  the  tests  in  block  (162) 
are  successively  multiplied  by  a,  the  tests  in  block  (163)  are  obtained  in  the 
following  order:  a,  ab,  c,  d,  acd,  be,  bd,  and  abed.  Any  of  the  other  tests 
in  block  (163)  could  have  been  used  with  the  tests  in  the  principal  block  to 
find  the  remaining  tests  in  block  (163).  In  each  case,  the  tests  would  be 
found  in  a different  order. 
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If  there  are  three  nonprincipal  blocks,  as  in  the  case  of  a quarter- 
replicate  design,  one  test  that  is  not  in  the  principal  block  is  selected,  and 
with  this  test,  the  remaining  tests  in  one  nonprincipal  block  are  found.  Then 
another  test  that  is  in  neither  the  principal  block  nor  the  first  nonprincipal 
block  is  selected,  and  the  tests  in  a second  nonprincipal  block  are  found. 

The  tests  which  have  not  been  found  in  the  principal  or  first  two  nonprincipal 
blocks  are  in  the  third  nonprincipal  block. 

It  may  be  necessary  to  know  not  only  that  two  coefficients  are  aliases, 
but  also  to  know  exactly  how  they  are  confounded  in  a particular  design.  For 
example,  it  is  sometimes  necessary  to  use  a fractional  factorial  design  even 
though  two  important  effects  are  confounded  with  each  other.  In  this  case,  it 
may  be  possible  to  use  two  or  more  different  fractional  factorial  designs  in 
order  to  obtain  separate  estimates  of  the  two  effects.  This  is  discussed 
further  in  the  next  section  on  partial  confounding.  However,  the  following 
observation  about  the  various  blocks  in  two- level  designs  will  be  helpful  in 
designing  such  experiments.  If  the  response  function  were  written  out  for 
each  test  in  the  principal  block  (162),  it  would  be  found  that  the  data 
obtained  could  be  used  to  calculate  the  sum  of  confounded  coefficients 
I - (1/8)ACD.  The  procedures  for  this  calculation  are  indicated  in  the  set 
of  equations  (148)  in  the  first  section  under  fractional  factorial  designs  and 
confounding.  The  data  from  the  tests  in  the  second  block  (163)  can  be  used  to 
calculate  the  sum  I + (1/8)  ACD.  If  a four-factor  experiment  is  designed  in 
which  I is  confounded  with  ABCD,  the  data  from  the  corresponding  principal 
block  will  yield  I + (1/16)ABCD.  Each  coefficient  in  a sum  of  defining  con- 
trasts is  multiplied  by  ( 1/2 )n , where  n is  the  number  of  letters  in  the  coef- 
ficient representing  factors.  The  sign  of  each  coefficient  in  the  sum  of 
defining  contrasts  for  a principal  block  may  be  determined  as  follows:  A 
coefficient  is  even  if  it  contains  letters  corresponding  to  an  even  number  of 
factors,  and  odd  if  it  contains  letters  corresponding  to  an  odd  number  of 
factors.  In  the  sum  of  defining  contrasts  for  the  principal  block,  the  even 
coefficients  are  positive  and  odd  coefficients  are  negative.  Coefficient  I 
contains  no  letters  corresponding  to  factors  and  is  always  even.  Coefficient 
ACD  is  odd.  Therefore,  data  from  the  principal  block  (162)  will  yield 
I - (1/8)ACD. 

The  signs  of  the  coefficients  in  the  sums  of  defining  contrasts  corres- 
ponding to  the  nonprincipal  blocks  are  determined  from  the  signs  in  the  sum 
of  defining  contrasts  for  the  principal  block  as  follows:  A test  is  selected 
in  the  block  for  which  the  signs  are  to  be  determined.  In  the  roasting 
experiment,  the  test  a is  in  the  nonprincipal  block  (163).  The  coefficients 
in  the  sum  corresponding  to  the  principal  block  which  contain  the  letter  A are 
each  multiplied  by  (-1).  The  letter  A in  the  coefficient  symbol  corresponds 
to  the  letter  a in  the  test  symbol.  In  this  example,  the  result  is 
I + (1/8)ACD,  and  this  is  the  sum  corresponding  to  block  (163).  Any  other 
test  in  this  block  could  have  been  used.  Consider  the  test  abed.  Each  coef- 
ficient containing  the  letters  A,  B,  C,  or  D is  multiplied  once  by  (-1)  for 
each  of  these  letters  which  it  contains.  If  the  coefficient  contains  two  of 
these  letters,  it  is  multiplied  by  (-1)8  ; if  it  contains  three  of  these  let- 
ters, it  is  multiplied  by  ( - 1 )3 , and  so  on.  The  coefficient  ACD,  therefore, 
is  multiplied  by  (-1)3  and,  again,  we  obtain  1 + ( 1/8)ACD.  As  another  example, 


consider  a quarter-replicate  design  with  the  following  set  of  defining  con- 
trasts: I,  ACE,  BDE,  and  ABCO.  The  principal  block  for  this  design  will 
yield  the  sum 

I - ( 1/8)ACE  - ( 1/ 8)BDE  + (1/16)ABCD.  (166) 

One  of  the  nonprincipal  blocks  contains  the  test  be.  Therefore,  the  sum  cor- 
responding to  this  block  is 

I - (-1)(1/8)ACE  - (-1)2 ( 1/8)BDE  + (-1)( 1/16)ABCD 

= I + ( 1/8)ACE  - ( 1/8)BDE  - (1/16)ABCD.  (167) 

It  remains  to  determine  how  the  other  sets  of  aliases  are  confounded  in 
any  block.  If  the  defining  contrasts  are  confounded  as  shown  in  the  sum  (166), 
then  the  sum  for  the  coefficient  A and  its  aliases  is  found  in  the  following 
way:  Each  coefficient  in  the  sum  (166)  together  with  its  sign  is  multiplied 
by  A to  obtain  A,  -CE,  -ABDE,  and  BCD.  Then  each  coefficient  so  obtained  is 
multiplied  by  (l/2)n_1 , where  n is  the  number  of  letters  in  the  coefficient. 
Thus  the  sum  so  obtained  is 

A - ( 1/2)CE  - ( 1/ 8)ABDE  + (l/4)BCD.  (168) 

In  this  example,  similar  sums  can  be  obtained  for  the  coefficients  B,  C,  D, 

E,  and  their  respective  aliases.  The  sum  for  the  coefficient  AB  and  its 
aliases  also  is  found  by  multiplying  each  coefficient  in  the  sum  (166) 
together  with  its  sign  by  AB.  In  this  case,  each  coefficient  then  is  multi- 
plied by  (l/2)n-2,  since  there  are  two  letters  in  the  coefficient  AB,  and  the 
sum  obtained  is 


AB  - ( 1/2)BCE  - ( 1/2)ADE  + CD.  (169) 

A similar  sum  can  be  obtained  for  the  coefficient  AD  and  its  aliases.  This 
procedure  can  be  applied  to  any  system  of  confounding. 

Higher  order  effect  coefficients  also  are  confounded  in  a two-level  fac- 
torial design.  Ordinarily,  the  range  of  values  for  each  factor  will  be 
selected  so  that  such  coefficients  are  not  expected  to  be  important.  However, 
one  should  be  aware  of  the  manner  in  which  higher  order  coefficients  can 
influence  the  estimates  of  lower  order  coefficients.  The  definition  of  group 
multiplication  for  two- level  designs  states  that  the  quadratic  components  of 
all  main  effects  are  confounded  with  the  average  effect  coefficient  I;  that  is, 
A2  = B2  = ...  =1.  Cubic  components  of  the  main  effects  will  be  confounded 
with  the  corresponding  linear  components,  for  example,  A3  = AA2  = AI  = A.  If 
higher  order  components  of  the  interaction  effect  are  important,  their  coeffi- 
cients also  will  be  confounded  with  lower  order  coefficients,  namely,  AB2  = A, 
A2B  = B,  A2B2  = I,  and  AaB  = AB3  ■ A3B3  = AB.  In  general,  A^B^  will  be  con- 
founded with  I if  both  p and  q are  even,  with  A if  p is  odd  and  q is  even, 
with  B if  p Is  even  and  q is  odd,  and  with  AB  if  both  p and  q are  odd. 


Partial  Confounding.  - Partial  confounding  is  used  in  an  experiment  when 
it  is  necessary  to  make  the  tests  in  different  blocks  and  yet  all  interactions 


among  the  major  factors  are  important.  Two  or  more  fractional  factorial 
designs  are  included  in  such  an  experiment  so  that  the  block  effects  can  be 
confounded  with  different  interaction  effects  in  each  design.  The  interaction 
effects  which  are  confounded  with  block  effects  in  any  particular  fractional 
factorial  design  cannot  be  estimated  by  the  tests  in  that  design.  Therefore, 
it  is  necessary  to  include  additional  fractional  factorial  designs  so  that 
each  of  the  major  factor  interaction  effects  can  be  estimated  by  the  tests  in 
at  least  one  of  the  designs. 

Let  a two- level  factorial  experiment  be  designed  to  study  the  effects 
upon  metal  extraction  of  time,  Xa,  temperature,  Xjj,  and  base  concentration, 

X,.,  during  leaching  of  a calcine  previously  prepared  under  reducing  conditions 
The  data  from  this  experiment  will  be  used  to  calculate  the  coefficients  I, 

A,  6,  C,  AB,  AC,  BC,  and  ABC,  all  of  which  are  expected  to  be  important.  The 
size  of  equipment  available  necessitates  making  the  eight  tests  in  two  blocks 
of  four  tests.  The  reduced  calcine  must  not  be  exposed  to  the  air,  and  it  is, 
therefore,  difficult  to  blend  the  calcines  from  several  different  roasts 
before  leaching.  It  is  possible  to  make  only  enough  calcine  for  four  leaches 
during  a single  roast.  Minor  variations  in  furnace  conditions  from  one  roast 
to  another  may  affect  the  results  obtained.  As  before,  the  effect  of  using 
calcines  from  different  roasts  is  regarded  as  the  effect  of  factor  X^,  and 
interactions  between  factor  Xj  and  the  three  principal  factors,  time,  tempera- 
ture, and  concentration,  are  unimportant. 

Two  experiments  can  be  designed  so  that  calculations  of  all  important 
coefficients  can  be  obtained.  One  experiment  can  be  designed  in  which  coeffi- 
cient D is  confounded  with  coefficient  ABC.  If  the  principal  block  is  used, 
the  results  of  this  experiment  can  be  used  to  calculate  I,  A,  B,  C,  AB,  AC, 

BC,  and  the  sum  D + ( 1/4) ABC.  Since  both  D and  ABC  are  important,  a second 
experiment  must  be  designed  in  which  D is  confounded  with  another  interaction 
coefficient,  say  AB.  In  this  case,  the  nonprincipal  block  can  be  used,  and 
the  results  of  this  second  experiment  can  be  used  to  calculate  I,  A,  B,  C,  AC, 
BC,  ABC,  and  the  sum  D + (1/2)AB.  Now,  the  independent  calculations  of  ABC 
from  the  data  of  the  second  experiment  and  AB  from  the  data  of  the  first 
experiment  may  be  used  to  calculate  D from  the  sums  D + (1/4)ABC  and 
D + (1/2)AB.  By  making  the  two  experiments,  the  coefficients  I,  A,  B,  C,  AC, 
and  BC  will  be  obtained  with  greater  precision  than  AB  and  ABC,  but  all  eight 
coefficients  may  be  obtained  in  this  way.  If  it  should  be  desired  to  obtain 
all  interaction  coefficients  with  the  same  degree  of  precision,  four  experi- 
ments may  be  designed  in  which  D is  confounded  with  ABC  in  the  first  experi- 
ment, with  AB  in  the  second,  with  AC  in  the  third,  and  with  BC  in  the  fourth. 
This  method  of  designing  two  or  more  experiments  to  obtain  calculations  of 
important  coefficients  is  called  partial  confounding. 

Higher  Degrees  of  Confounding  for  Two-Level  Designs 

Designs  involving  higher  degrees  of  confounding  are  useful  when  testing 
several  factors  at  two  levels  each  and  some  or  all  of  these  factors  do  not 
interact,  when  it  is  necessary  to  introduce  several  minor  factors  into  an 
experiment,  and  when  tests  are  planned  to  select  from  several  possible  factors 
those  which  are  more  important  for  detailed  investigation.  The  possibility  of 


using  a relatively  small  fractional  replicate  design  involving  a higher  degree 
of  confounding  deserves  greater  attention  as  the  number  of  factors  to  be 
studied  increases. 

The  procedure  for  designing  fractional  factorial  experiments  with  higher 
degrees  of  confounding  is  an  extension  of  the  procedure  already  presented  for 
half-replicate  designs  in  the  preceding  sections.  This  procedure  may  be  illus- 
trated with  the  following  example  that  involves  the  introduction  of  minor  fac- 
tors into  an  experiment.  A vanadium  ore  is  to  be  given  a chloridizing  roast 
to  increase  the  vanadium  extraction.  Four  factors  are  to  be  studied  as  fol- 
lows: Temperature,  Xa,  at  750°  and  800°  C. ; roasting  time,  X5,  at  1 and  2 
hours;  salt  addition,  Xc,  at  6 and  9 percent  of  the  charge;  and  particle  size, 
X<j»  at  1/4-inch  and  10-mesh  (0.065-inch).  The  full  two-level  factorial 
design  for  four  factors  requires  16  tests.  There  is  room  for  four  charges  in 
the  roasting  furnace,  and  there  is  a possibility  that  eddy  currents  and  other 
furnace  characteristics  may  cause  variations  in  roasting  conditions  in  differ- 
ent locations  in  the  furnace.  The  principles  of  confounding  are  used  to 
select  the  location  for  each  test  charge  so  that  effects  due  to  differences 
in  location  will  not  be  confounded  with  the  more  important  major  factor 
effects. 

The  four  furnace  locations  can  be  regarded  as  four  levels  of  a qualita- 
tive location  factor,  Xe.  The  effect  of  this  factor  can  be  analyzed  for 
linear,  quadratic,  and  cubic  components  with  the  coefficients  E,  E2 , and  E3 , 
respectively.  However,  since  the  present  discussion  is  focused  upon  two-level 
designs,  the  coefficient  E3  can  be  replaced  with  F and  E3  with  EF.  The  result 
will  be  a design  based  on  the  arbitrary  assumption  that  there  are  two  location 
factors,  Xe  and  Xf.  The  interaction  effect  between  Xe  and  Xf  corresponds  to 
the  cubic  component  of  the  effect  of  a single  location  factor  at  four  levels. 
The  resulting  design  and  subsequent  evaluation  of  the  effects  of  the  major 
factors  will  be  the  same  whether  one  location  factor  at  four  levels  or  two 
location  factors  each  at  two  levels  are  assumed.  The  assumption  of  two  fac- 
tors is  made  in  other  literature10  and  permits  the  experiment  to  be  designed 
as  a two- level  experiment.  All  interaction  effects  involving  both  major 
experiment  factors  and  location  factors  are  considered  to  be  negligible. 

The  system  of  confounding  that  prevents  location  effects  from  being  con- 
founded with  the  more  important  major  factor  effects  is  obtained  by  designing 
a quarter  replicate  of  a full  six-factor,  two-level  factorial  design.  Each  of 
the  location  effect  coefficients  E,  F,  and  EF  is  to  be  confounded  with  one  of 
the  major  effect  coefficients.  In  general,  the  coefficients  of  the  higher 
order  interaction  effects  of  the  major  factors,  that  is,  those  interaction 
effects  involving  a larger  number  of  factors,  are  confounded  with  the  location 
effect  coefficients.  Some  caution  is  necessary  in  selecting  the  coefficients 
to  be  confounded;  for  example,  if  E is  confounded  with  ABCD  and  F with  ABC, 
then  EF  will  be  confounded  with  the  product  (ABCD)(ABC).  The  symbolic  group 
theory  notation  for  two- level  designs  given  in  expression  (158)  states  that 
As  =B^  = C?  =>  E3  “I®  ■!.  Therefore,  any  letter  which  is  squared  in  a 


10Davies,  0.  L. , Design  ai  d Analysis  of  Industrial  Experiments:  Hafner  Pub- 
lishing Co.,  New  York,  N./. , 2d  ed. , 1956,  pp.  387-389. 


product  of  coefficients  is  omitted,  and  (ABCD)(ABC)  = D.  Thus,  EF  is  con- 
founded with  the  main  effect  coefficient  D.  A more  desirable  selection  is  to 
let  E be  confounded  with  ABC  and  F with  ABD.  Then  EF  will  be  confounded  with 
the  product  (ABC)(ABD)  = CD.  This  latter  selection  is  expressed  in  group 
theory  notation  as  follows: 

E - ABC,  F = ABD,  and  EF  = CD.  (170) 

The  set  of  defining  contrasts  for  the  complete  system  of  confounding  is  deter 
mined  by  the  three  pairs  of  aliases  (170).  The  symbolic  equality  E = ABC  is 
preserved  if  both  sides  are  multiplied  by  E.  Therefore,  E2  = ABCE,  but  E2  is 
confounded  with  I,  and  so  I = ABCE.  Similarly,  F2  = ABDF  and  E2E®  = CDEF  and 
the  set  of  defining  contrasts  is 

I,  ABCE,  ABDF,  CDEF.  (171) 

Now  to  find  the  aliases  of  coefficient  A,  each  coefficient  in  the  set  of 
defining  contrasts  is  multiplied  by  A to  obtain  AI  = A,  A(ABCE)  = BCE, 

A(ABDF)  = BDF,  and  A(CDEF)  = ACDEF.  The  main  and  interaction  effects  of  the 
four  factors,  Xfl,  Xb,  Xc,  and  Xd,  are  of  major  importance,  and  the  aliases  of 
each  corresponding  coefficient  are  as  follows: 


A, 

BCE, 

BDF, 

ACDEF; 

B, 

ACE, 

ADF, 

BCDEF ; 

c. 

ABE, 

ABCDF , 

DEF; 

D, 

ABCDE, 

ABF, 

CEF; 

AB, 

CE, 

DF, 

ABCDEF ; 

AC, 

BE, 

BCDF, 

ADEF; 

AD, 

BCDE, 

BF, 

ACEF; 

BC, 

AE, 

ACDF, 

BDEF; 

BD, 

ACDE, 

AF, 

BCEF; 

CD, 

ABDE, 

ABCF, 

EF; 

ABC, 

E, 

CDF, 

ABDEF ; 

ABD, 

CDE, 

F, 

ABCEF ; 

ACD, 

BDE, 

BCF, 

AEF ; 

BCD, 

ADE, 

ACF, 

BEF; 

ABCD, 

DE, 

CF, 

ABEF. 

Each  of  the  major  factor  effect  coefficients  is  confounded  with  three 
negligible  coefficients  of  interaction  effects  involving  both  major  factors 
and  location  factors  with  the  exception  of  the  three  coefficients  ABC,  ABD, 
and  CD.  Therefore,  a design  with  this  system  of  confounding  can  be  used  to 
estimate  the  main  effects  and  most  of  the  interaction  effects  of  the  major 
factors  without  interference  due  to  effects  of  different  locations  in  the 
furnace. 

The  next  step  in  designing  a quarter-replicate  design  with  the  system  of 
confounding  given  by  the  sets  of  aliases  (171)  and  (172)  is  to  determine  the 
16  tests  which  must  be  made.  As  explained  in  the  section  on  defining  con- 
trasts and  the  principal  block,  each  test  in  the  full  six-factor,  two-level 


factorial  design  must  be  compared  in  a systematic  way  with  each  coefficient  in 
the  set  of  defining  contrasts  (171)  in  order  to  find  the  tests  in  the  princi- 
pal block.  In  this  case,  each  test  must  be  compared  with  three  coefficients, 
other  than  I,  instead  of  one  as  in  the  case  of  half-replicate  designs.  The 
test  aubvCwdxeyfz  is  compared  with  the  coefficient  AnBPC<lDrEsFt  by  obtaining 
the  sum  nu  + pv  + qw  + rx  + sy  + tz.  If  a sum  equal  to  zero  or  a multiple  of 
2 is  obtained  for  a particular  test  and  each  of  the  defining  contrasts,  that 
test  is  in  the  principal  block.  There  are  16  = 24  tests  to  be  found,  and  so 
four  independent  tests  must  be  found  by  direct  comparison  with  the  defining 
contrasts  before  the  remaining  tests  can  be  found  by  group  multiplication  of 
the  test  symbols.  The  first  four  independent  tests  in  the  principal  block  are 

ab,  ce,  df,  acd.  (173) 

These  are  multiplied  together  in  pairs  to  obtain  six  additional  tests: 

abce,  abdf,  bed,  edef,  adc,  acf.  (174) 

The  first  four  tests  (173)  are  multiplied  together  in  sets  of  three  to  obtain 
four  more  tests : 


abedef,  bde,  bef,  aef.  (175) 

Finally,  all  four  tests  (173)  are  multiplied  together  to  give 

bef.  (176) 

These  15  tests  together  with  test  (1)  make  up  the  principal  block  of  the 
quarter-replicate  design.  The  other  three  design  blocks  are  found  by  the 
procedure  discussed  in  the  section  on  defining  contrasts  and  the  principal 
block.  The  letters  e and  f in  the  test  symbols  serve  to  identify  the  furnace 
location  in  which  each  of  the  16  tests  is  to  be  made.  The  four  test  symbols 
in  the  principal  design  block  that  contain  neither  e nor  f represent  tests  to 
be  made  in  location  1 in  the  furnace.  The  symbols  that  contain  e but  not  f 
represent  tests  to  be  made  in  location  2,  those  that  contain  f but  not  e repre 
sent  tests  to  be  made  in  location  3,  and  those  that  contain  both  e and  f repre 
sent  tests  to  be  made  in  location  4.  After  the  locations  are  identified,  the 
letters  e and  f are  omitted  from  all  symbols  and  the  16  tests  in  the  original 
four-factor  experiment  are  divided  among  the  four  furnace  locations  as  follows 


Location 


Tests 


1 

2 

3 

4 


(1),  ab,  acd,  bed 

c,  abc,  ad,  bd  (e  omitted) 

d,  abd,  ac,  be  (f  omitted) 
cd,  abed,  a,  b (ef  omitted) 


(177) 


If  one  of  the  design  blocks  other  than  the  principal  block  had  been  used, 
the  16  original  tests  would  be  divided  among  the  four  locations  in  the  same 
way.  For  example,  one  design  block  contains  the  tests  ef,  abef,  aedef,  and 
bedef,  and  12  other  tests.  When  the  letters  ef  are  omitted  from  these  first 
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four  symbols,  the  tests  become  (1),  ab,  acd,  and  bed  which  are  the  same  four 
tests  assigned  to  location  1 in  (177). 

The  16  tests  in  the  original  four-factor  experiment  have  been  divided 
among  four  locations  or  four  experiment  blocks.  The  usual  procedure  now  would 
be  to  make  the  tests  in  each  block  in  a random  order;  that  is,  tests  (1),  ab, 
acd,  and  bed  would  be  made  in  a random  order  in  one  block  of  tests,  then  tests 
c,  abc,  ad,  and  bd  in  a random  order  in  a second  block  of  tests,  and  so  on. 

In  the  present  example,  a block  of  tests  represents  four  tests  to  be  made  in 
a particular  furnace  location,  one  test  in  each  of  four  different  furnace  cam- 
paigns. Hie  order  in  which  the  tests  are  made  cannot  be  randomized  in  all 
locations  because  the  four  tests  in  any  one  furnace  campaign  must  all  be  made 
at  the  same  temperature.  However,  the  tests  can  be  assigned  to  furnace  cam- 
paigns in  the  following  way:  Tests  (1),  c,  d,  and  cd  are  to  be  made  in  loca- 
tions 1,  2,  3,  and  4,  respectively,  and  are  each  to  be  made  at  750°  C.  for  1 
hour.  These  four  tests  are  made  during  one  campaign.  Similarly,  tests  ab, 
abc,  abd,  and  abed  at  800°  C.  for  2 hours  are  made  during  another  campaign. 
Tests  acd,  ad,  ac,  and  a at  800°  C.  for  1 hour  are  made  during  a third  cam- 
paign, and  tests  bed,  bd,  be,  and  b at  750°  C.  for  2 hours  are  made  during  a 
fourth  campaign.  The  four  campaigns  are  conducted  in  a random  order. 

When  an  experiment  involving  higher  degrees  of  confounding  is  to  be 
designed,  it  is  helpful  to  know  that  the  set  of  defining  contrasts  is  a sub- 
group. A subgroup  is  a set  of  some  of  the  members  of  a known  group  that, 
together  with  the  operation  defined  for  the  known  group,  also  satisfies  the 
requirements  of  a group.  For  example,  the  64  coefficients  in  a six-factor, 
two-level  factorial  experiment  and  the  group  multiplication  operation  consti- 
tute a group.  Of  these  64  coefficients,  the  four  in  the  set  of  defining  con- 
trasts (171),  I,  ABCE,  ABDF,  and  CDEF,  and  group  multiplication  also  consti- 
tute a group.  The  identity  member  is  I,  each  coefficient  is  its  own  inverse, 
and  the  product  of  any  two  of  these  coefficients  also  is  one  of  the  defining 
contrasts,  for  example,  (ABCE)(ABDF)  = CDEF.  The  fact  that  the  set  of  defin- 
ing contrasts  for  any  system  of  confounding  is  a subgroup  makes  it  possible  to 
find  some  of  the  coefficients  in  the  set  of  defining  contrasts  by  group  multi- 
plication of  other  coefficients  known  to  be  in  the  set. 

The  following  example  shows  how  a high  degree  of  confounding  can  be  used 
in  the  design  of  an  experiment  for  preliminary  research.  Sometimes  a research 
problem  involves  a large  number  of  factors  that  may  influence  the  response  to 
be  investigated.  There  is  not  enough  time  to  investigate  all  factors  thor- 
oughly, so  a selection  of  the  more  important  factors  must  be  made.  A frac- 
tional factorial  design  with  a high  degree  cf  confounding  can  be  used  as  a 
guide  in  making  the  selection. 

Consider  the  flotation  of  a beryllium  mineral  from  a complex  ore  in  which 
the  following  seven  factors  may  influence  the  results.  The  first  factor,  Xa, 
is  the  addition  of  sodium  fluoride  to  the  slurry  before  conditioning  at  two 
levels,  2.25  and  4.25  pounds  sodium  fluoride  per  ton  of  solids.  The  second 
factor,  X],,  is  the  time  at  which  sodium  fluoride  is  added  to  the  slurry.  The 
two  levels  of  this  factor  are  qualitative,  and  the  sodium  fluoride  is  to  be 
added  before  or  after  grinding.  The  third  factor,  Xc,  is  the  conditioning 
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time  of  the  slurry  after  grinding  and  before  other  reagents  are  added  to  the 
slurry.  The  levels  of  Xc  are  5 and  15  minutes.  The  fourth  factor,  X<j,  is  the 
addition  of  sodium  hexametaphosphate  to  the  slurry  in  the  amounts  of  2.25  and 
2.75  pounds  per  ton  of  slurry.  The  fifth  factor,  Xe,  is  the  conditioning  time 
after  the  phosphate  salt  has  been  added  and  before  the  collector  is  added  to 
the  slurry.  The  levels  of  Xe  are  5 and  10  minutes.  The  sixth  factor,  Xf,  is 
the  addition  of  collector  to  the  slurry.  The  collector  is  a mixture  of  oleic 
acid  and  kerosene.  The  amount  of  oleic  acid  is  constant  at  1 pound  per  ton, 
but  the  amount  of  kerosene  is  to  be  studied  at  two  levels,  0.5  and  1 pound  per 
ton  of  solids.  The  seventh  factor,  Xg,  is  the  final  conditioning  time  after 
the  collector  has  been  added  and  the  levels  are  5 and  10  minutes. 

A fractional  factorial  design  for  this  experiment  will  reduce  the  number 
of  tests  needed  for  a preliminary  evaluation  of  the  seven  factors.  A full 
two-level  factorial  design  for  seven  factors  requires  128  tests,  and  if  some 
tests  are  to  be  duplicated  to  obtain  an  estimate  of  experimental  error,  the 
number  of  tests  is  larger  than  128.  A one-sixteenth  replicate  of  this  design 
contains  eight  tests  and  can  be  designed  so  that  no  two  main  effect  coeffi- 
cients are  confounded  with  each  other.  Each  coefficient  will  be  confounded 
with  15  other  aliases,  so  there  is  a chence  of  some  bias  in  the  estimates  of 
the  main  effect  coefficients.  However,  the  results  of  the  experiment  can  be 
used  as  a guide  to  select  the  factors  which  are  more  important  for  more 
thorough  investigation. 

Previous  experience  with  the  factors  under  investigation  can  be  used  in 
designing  the  fractional  factorial  experiment  so  as  to  minimize  the  chance  of 
bias  in  the  main  effect  estimates.  The  response  surface  upon  which  statisti- 
cal evaluation  of  the  data  is  based  can  be  approximated  over  a sufficiently 
small  region  by  a plane  surface  which  represents  only  linear  main  effects  of 
each  of  the  factors.  Within  this  small  region,  interaction  effects  are  less 
important  than  they  are  over  larger  regions.  Now,  consider  factor  Xa  in  the 
flotation  experiment.  The  practical  range  of  levels  for  this  factor  may  be, 
say,  0 to  10  pounds  sodium  fluoride  per  ton  of  solids.  Previous  experience 
shows  that  a difference  of  at  least  2 pounds  in  the  levels  of  this  factor  is 
necessary  before  a significant  difference  in  the  response  values  for  two  dif- 
ferent tests  can  be  obtained.  Therefore,  the  levels  in  a two- level  design 
should  be  2 pounds  apart.  A larger  spacing  increases  the  chance  for  bias  in 
main  effect  estimates  because  of  interaction  effects.  Similar  experience 
should  guide  the  spacing  of  levels  for  each  of  the  other  factors.  If  previous 
experience  is  not  available,  then  a reasonably  small  fraction  of  the  practical 
range  of  levels  should  be  used  in  preliminary  research. 

The  one-sixteenth  replicate  of  a seven-factor,  two-level  factorial  experi- 
ment is  designed  as  follows:  Each  main  effect  coefficient  is  confounded  with 
3 two-factor  interaction  effect  coefficients  and  12  higher  order  coefficients 
by  confounding  A with  BE,  CF,  and  DG  in  one  set  of  aliases  and  B with  DF  in 
another  set;  that  is, 


A = BE  = CF  = DG,  and  B = DF 


(178) 
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Four  coefficients  in  the  set  of  defining  contrasts  are  determined  by  the 
aliases  in  set  (178).  These  are 

ABE,  ACF,  ADG,  BDF.  (179) 

The  remaining  11  coefficients  in  the  set  are  found  by  making  use  of  the  fact 
that  the  coefficients  in  the  set  of  defining  contrasts  and  group  multiplica- 
tion constitute  a subgroup.  Since  this  is  true,  the  remaining  11  coefficients 
can  be  found  as  products  of  the  4 coefficients  in  set  (179).  Six  coefficients 
are  obtained  as  products  of  pairs  of  the  coefficients  in  set  (179): 

BCEF,  BDEG,  ADEF,  CDFG,  ABCD,  ABFG.  (180) 

Four  more  are  found  as  products  of  three  of  the  coefficients  in  set  (179): 

ABCDEFG,  CDE,  EFG,  BCG.  (181) 

The  last  coefficient  is  obtained  as  the  product  of  all  four  coefficients  in 
set  (179). 


ACEG.  (182) 

The  15  coefficients  in  sets  (179),  (180),  (181),  and  (182)  and  the  coefficient 
I are  the  coefficients  in  the  set  of  defining  contrasts  for  a one-sixteenth 
replicate  design. 

There  are  eight  tests  in  the  principal  block  for  this  design,  and  three 
independent  tests  are  found  by  comparison  of  the  tests  in  the  seven-factor, 
two-level  design  with  each  of  the  15  coefficients  in  sets  (179),  (180),  (181), 
and  (182).  The  remaining  four  tests  are  found  as  products  of  the  three 
independent  tests,  and  the  tests  in  the  principal  block  are 

(1),  abed,  abfg,  aceg,  cdfg,  bdeg,  beef,  adef.  (183) 

There  are  15  other  blocks  of  eight  tests  that  may  be  obtained,  and  any  one  of 
these  or  the  principal  block  (183)  may  be  used  to  obtain  estimates  of  the  main 
effect  coefficients  A,  B,  C,  D,  E,  F,  and  G,  as  well  as  the  average  effect 
coefficient  I.  If  there  were  no  other  estimate  of  experimental  error  avail- 
able, the  eight  tests  in  the  block  being  selected  should  be  duplicated  to 
obtain  an  experimental  error  estimate  for  comparison  with  each  of  the  main 
effect  estimates. 

The  experiment  based  upon  the  one-sixteenth  replicate  design  in  the  pre- 
ceding discussion  is  the  first  step  in  the  investigation  of  the  seven  factors. 
The  next  step  is  determined  by  the  results  of  the  first  experiment.  One  possi- 
ble result  is  for  the  main  effect  estimates  of  three  of  the  factors  to  be  sig- 
nificantly greater  than  the  other  four  main  effect  estimates.  Suppose  the 
three  factors  are  X^,  addition  of  sodium  hexametaphosphate  to  the  slurry;  Xe, 
the  conditioning  time  after  the  phosphate  salt  has  been  added;  and  Xg,  the 
final  conditioning  time  after  collector  has  been  added.  Then  subsequent  exper- 
iments would  be  planned  to  investigate  the  main  and  interaction  effects  of 


these  three  factors  in  greater  detail.  For  the  subsequent  tests  the  other 
four  factors  would  be  held  at  constant  levels. 

Another  possible  result  of  the  data  obtained  from  the  one-sixteenth 
replicate  experiment  is  for  all  main  effect  estimates  to  be  sufficiently  large 
to  require  further  investigation  before  any  of  the  factors  are  eliminated.  In 
this  case,  the  design  can  be  expanded  to  a one-eighth  replicate  design  by 
making  eight  additional  tests.  These  eight  tests  can  be  determined  by  the 
following  considerations:  It  generally  is  assumed  that  higher  order  interac- 
tion effects  will  be  less  significant  than  those  of  lower  order.  Therefore, 
the  next  step  will  be  to  design  the  experiment  so  as  to  separate  main  effect 
coefficients  from  two-factor  interaction  effect  coefficients.  The  procedures 
used  here  were  introduced  in  the  section  on  defining  contrasts  and  the  princi- 
pal block.  Suppose  the  principal  block  of  tests  (183)  had  been  made  in  the 
first  experiment.  The  responses  from  these  eight  tests  can  be  combined  alge- 
braically to  obtain  eight  sums  of  aliases.  One  of  these  will  be  the  sum  of 
defining  contrasts  in  which  even  coefficients  are  positive  and  odd  coeffi- 
cients are  negative;  that  is, 

I - ( 1/8)ABE  - ( 1/ 8)ACF  - (1/8)ADG  - (1/8)BDF  + (1/16)BCEF 
+ ( 1/ 16)BDEG  + ( 1/ 16)ADEF  + (1/16)CDFG  + (1/16)ABCD 
+ ( 1/ 16)ABFG  - ( 1/ 128)ABCDEFG  - (1/8)CDE  - (1/8)EFG 
- ( 1/8)BCG  + (1/16)ACEG.  (184) 

From  this  sum  (184)  the  sums  containing  each  of  the  main  effect  coefficients 
can  be  obtained;  for  example,  the  sum  containing  the  coefficient  A is 

A - ( 1/ 2)BE  - ( 1/2)CF  - ( 1/2)DG  - (1/8)ABDF  + (1/16)ABCEF 
+ ( 1/16)ABDEG  + ( 1/4)DEF  + (1/16)ACDFG  + (1/4)BCD  + (1/4)BFG 
- ( 1/ 32)BCDEFG  - (1/8)ACDE  - (1/8)AEFG  - ( 1/8)ABCG  + (1/4)CEG.  (185) 

For  the  present  discussion,  attention  is  focused  upon  main  effects  and 
two-factor  interaction  effects  such  as  the  first  four  terms  in  sum  (185); 
that  is,  A - (1/2)BE  - (1/2)CF  - (1/2)DG.  The  eight  additional  tests  to  be 
made  must  constitute  a one-sixteenth  replicate  experiment  which  will  yield  a 
sum  whose  first  four  terms  are  A + (1/2)BE  + (1/2)CF  + (1/2)DG.  These  tests 
are  found  by  finding  the  nonprincipal  block  with  the  sum  of  defining  contrasts 
similar  to  the  sum  (184)  but  in  which  all  terms  are  positive.  The  block  con- 
taining the  test  abe  yields  the  desired  sum.  Each  term  in  sum  (184)  which 
contains  one  of  the  letters  A,  B,  or  E is  multiplied  by  (-1);  each  term  con- 
taining two  of  the  letters  is  multiplied  by  (-1)2  ; and  each  term  containing 
all  three  letters  is  multiplied  by  (-1)3.  The  other  tests  are  found  by  multi- 
plying each  test  in  the  principal  block  (183)  by  abe.  The  eight  additional 
tests  are 


abe,  cde,  efg,  beg,  abedefg,  adg,  acf,  bdf.  (186) 

The  expanded  design  consisting  of  the  tests  in  the  principal  block  (183)  and 
the  block  (186)  constitutes  the  principal  block  of  a one-eighth  replicate 
design  in  which  all  main  effect  coefficients  are  separated  from  two-,  four-. 
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and  six-factor  interaction  effect  coefficients.  Main  effects  still  will  be 
confounded  with  three-  and  five-factor  interaction  effects.  The  assumption 
that  higher  order  interaction  effects  are  less  significant  than  lower  order 
interaction  effectsmeans  that  the  main  effect  estimates  obtained  from  the  one- 
eighth  replicate  design  can  be  compared  and  the  more  important  factors 
selected  with  greater  confidence.  At  this  point,  the  more  important  factors 
may  be  studied  in  greater  detail,  or  the  one-eighth  replicate  design  can  be 
expanded  to  a one-quarter  replicate  design  by  making  16  additional  tests. 

Other  blocks  than  the  principal  blocks  of  the  one-sixteenth  replicate 
design  can  be  expanded  to  separate  main  effect  coefficients  from  two-factor 
interaction  effect  coefficients.  For  example,  one  block  contains  the  test  a. 
The  other  tests  in  this  block  are  found  by  multiplying  each  of  the  tests  in 
the  principal  block  (183)  by  letter  a to  obtain 

a,  bed,  bfg,  ceg,  aedfg,  abdeg,  abcef,  def.  (187) 

The  eight  additional  tests  to  expand  this  block  to  a one-eighth  replicate 
design  are  found  by  multiplying  each  of  the  tests  (186)  also  by  letter  a to 
obtain 

be,  aede,  aefg,  abeg,  bedefg,  dg,  cf,  abdf.  (188) 

The  results  of  the  16  tests  in  sets  (187)  and  (188)  will  yield  estimates  of 
the  main  effects  separated  from  the  two-factor  interaction  effects. 

In  the  case  of  factors  which  do  not  interact  with  each  other,  a frac- 
tional factorial  design  with  a high  degree  of  confounding  can  be  used  directly 
to  obtain  estimates  of  the  main  effects.  It  may  be  desired  to  evaluate  the 
effects  of  several  qualitative  or  minor  factors  upon  a given  response.  Such 
factors  include  the  performance  of  different  men  or  different  machines,  the 
making  of  tests  on  different  days,  and  so  on.  When  several  such  factors  are 
to  be  tested,  the  smallest  fractional  factorial  design  which  gives  separate 
estimates  of  all  main  effects  is  used. 

Confounding  in  Factorial  Designs  Having  More  Than 
Two  Levels  per  Factor 

As  more  levels  per  factor  are  included  in  a design,  a more  complete  evalu- 
ation of  the  effects  of  the  factors  can  be  made  over  a given  range  of  values 
for  each  factor.  If  some  of  the  higher  order  components  of  the  effects  are 
not  important,  it  is  still  desirable  to  estimate  the  lower  order  components 
over  a larger  number  of  levels  per  factor.  In  this  case  a full  factorial 
design  may  require  more  tests  and  give  more  information  than  necessary.  The 
general  procedure  for  confounding,  \*hich  has  been  introduced  in  terms  of  the 
sisg>ler  two- level  designs,  can  be  extended  for  the  design  of  experiments 
having  uiore  than  two  levels  per  factor  so  as  to  reduce  the  number  of  tests  and 
yet  permit  the  evaluation  of  the  important  effects  over  the  desired  number  of 
levels.  The  general  procedure  has  the  advantage  that  it  can  be  used  in  more 
extensive  designs,  whereas  the  method  based  upon  a table  of  signs  cannot. 
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An  example  in  which  the  principles  of  confounding  are  useful  in  a design 
involving  more  than  two  levels  per  factor  is  the  following:  There  are  three 
factors  to  be  studied,  each  at  four  levels.  In  this  case,  a full  factorial 
design  requires  64  tests.  An  experiment  based  upon  this  design  will  yield 
data  from  which  to  calculate  the  coefficients  I,  A,  A2,  A3,  B,  B2  , B3  , C,  C2  , 

C3 , and  54  interaction  coefficients.  It  may  be  known  that  only  linear  and 
quadratic  components  of  the  main  effects  and  the  lower  order  components  of  the 
two-factor  interaction  effects  are  important.  A one-fourth  replicate  of  the 
full  four-level  design  contains  16  tests  and  can  be  used  to  calculate  I,  A, 

A2  , B,  B2  , C,  C2  , AB,  AB2  , A2B,  AC,  AC?,  A2C,  BC,  BC2  , and  B2C.  The  set  of 
defining  contrasts  for  the  design  contains  the  aliases  I,  ABC,  A2B2C2,  and 
A3B3C3.  For  comparison,  a full  three-factor,  three-level  factorial  design 
requires  27  tests,  or  11  more  tests  than  are  necessary  if  the  coefficients 
A2B2  , A2C?,  B2C2,  and  the  various  components  of  the  three-factor  interaction 
effect  are  not  important.  The  one-fourth  replicate  of  the  four-level  design 
also  has  the  advantage  of  including  four  levels  for  each  factor  in  the  inves- 
tigation. As  the  number  of  levels  per  factor  increases,  the  possibility  of 
reducing  the  number  of  tests  by  means  of  a fractional  replicate  of  a full 
design  deserves  increasing  attention. 

The  general  procedure  for  confounding  effects  in  fractional  factorial 
designs  now  will  be  applied  to  designs  in  which  each  factor  is  studied  at 
three  levels.  Three-level  factorial  experiments  are  designed  to  estimate 
linear  and  quadratic  components  of  the  main  effect  of  each  factor  and  the  cor- 
responding components  of  the  interaction  effects.  Such  designs  cannot  be  used 
to  estimate  cubic  components  of  the  main  effects.  The  general  procedure  for 
confounding  in  three-level  designs  is  based  upon  the  fact  that  coefficients  of 
cubic  components  of  main  effects  are  confounded  with  the  average  effect  coeffi- 
cient I.  This  is  expressed  in  group  theory  notation  as  follows: 

A3  = B3  = C3  = D3  = . . . = I.  (189) 

This  corresponds  to  the  similar  expression  (158)  for  two- level  designs  in 
which  quadratic  component  coefficients  A2,  B2 , and  so  forth  are  confounded 
with  I.  The  coefficients  for  which  a three-level  factorial  experiment  is 
designed  are  multiplied  together  in  the  procedure  for  confounding  effects. 

Because  of  the  relationship  (189),  the  symbolic  product  of  any  set  of  the 
coefficients  which  are  estimated  by  a three-level  factorial  experiment  also 
will  be  one  of  these  coefficients.  For  example,  (A2BC)(AB)  = A3B2C,  but  A3  is 
confounded  with  I which  serves  as  the  identity  and  is  omitted  from  any  product 
unless  it  stands  alone.  Therefore,  (A3BC)(AB)  = B2C.  Another  example  product 
is  the  following:  (AB2C)(B2(?)  = AB3BC3  = AB.  The  set  of  coefficients  for  a 
three-level  factorial  design  and  group  multiplication  satisfies  the  mathemati- 
cal requirements  for  a group.  Products  are  members  of  the  set,  the  identity 
member  is  I,  and  the  inverse  of  each  member  is  the  square  of  that  member;  for 
example,  (A)(A2)  = A3  =1;  also  (AB2C)2  = A2B4C2  = A2BC?  , since  B4  = B3B  = IB 
= B,  and  (AB2C)(A2BC? ) = I3  = I. 

Confounding  in  three-level  designs  will  be  illustrated  by  means  of  an 
experiment  to  study  three  factors,  Xa,  Xj,,  and  Xj,.  The  least  number  of 


coefficients  in  a subgroup,  and  therefore  in  a set  of  defining  contrasts,  for 
a three-level  design  is  three.  One  may  choose  I and  any  other  coefficient, 
say  A2B,  to  be  in  the  subset.  The  third  member  is  the  inverse  of  the  second 
coefficient;  in  this  case,  (A2B)2  = AB2 . There  are  13  possible  subgroups  in 
this  example  which  contain  three  coefficients,  and  each  of  the  27  coefficients, 
I,  A,  A2 , ...,  A2B2C2,  will  be  in  one  of  these  subgroups.  These  13  subgroups 
are  shown  in  table  7 together  with  conventional  symbols,  which  have  been 
adopted  in  the  literature11  for  more  convenient  reference  to  the  subgroups  in 
subsequent  discussion. 


TABLE  7.  - The  13  subgroups  of  three  coefficients  which  may  be  obtained 
from  the  group  of  27  coefficients  calculated  from  the  data 
of  a full  three-factor,  three- level  factorial  design 


Subgroup 

Corresponding 

symbol 

Subgroup 

Corresponding 

symbol 

I,  A,  A2 

A 

I,  B2C,  BC2 

I(BC) 

I,  B,  B2 

B 

I,  BC,  B2^ 

J(BC) 

I,  C,  CP 

C 

I,  A2BC,  AB2^ 

W(ABC) 

I,  A2B,  AB2 

I(AB) 

I,  AB2 C,  A2BC2 

X(ABC) 

I,  AB,  A2B2 

J(AB) 

I,  ABC2,  A2B2C 

Y(ABC) 

I,  A2C,  AC2 

I(AC) 

I,  ABC,  A2B2C3 

Z(ABC) 

I,  AC,  A2C® 

J(AC) 

A three-member  set  of  defining  contrasts  may  be  used  to  design  a one- 
third  replicate  of  a full  three- level  design.  The  most  useful  of  the  sub- 
groups in  table  7 to  be  selected  as  defining  contrasts  are  W(ABC) , X(ABC) , and 
Y(ABC).  The  data  from  an  experiment  based  on  one  of  these  subgroups  can  be 
used  to  calculate  both  linear  and  quadratic  component  coefficients  of  all 
three  factors  and  two  of  the  three  lowest  order  two-factor  interaction  compo- 
nent coefficients;  that  is,  I,  A,  A2,  B,  B2 , C,  C2 , and  two  of  the  three  coef- 
ficients AB,  AC,  and  BC.  Such  an  experiment  consists  of  9 tests,  whereas  the 
full  three-level  design  consists  of  27  tests. 

The  other  subgroups  in  table  7 generally  are  less  useful  but  may  be  of 
value  in  particular  applications.  The  subgroup  Z(ABC)  may  be  used  to  calcu- 
late all  main  effect  coefficients,  but  the  lowest  order  interaction  coeffi- 
cients AB,  AC,  and  BC  are  confounded  with  C2 , B2 , and  A2,  respectively.  The 
subgroup  J(AB)  confounds  B2  with  A and  A2  with  B;  therefore,  this  subgroup 
would  be  used  only  if  A2  and  B2  were  known  to  be  negligible,  and  it  were 
desired  to  evaluate  A and  B over  three  levels  for  each  corresponding  factor. 
Both  C and  C2  can  be  obtained,  and  two  components  of  each  of  the  interaction 
effects  between  Xa  and  Xc  and  between  Xb  and  Xc  can  be  estimated;  that  is,  AC, 
AC2,  BC,  and  BC2  can  be  calculated.  Similar  results  are  obtained  with  sub- 
groups J(AC)  and  J(BC).  The  subgroups  I(AB) , I(AC) , I(BC),  A,  B,  and  C cause 
some  main  effect  coefficients  to  be  confounded  either  with  each  other  or  with 
the  average  effect  coefficient  I and  are  of  little  value  in  designing  frac- 
tional factorial  experiments. 

11  Davies,  0.  L. , Design  and  Analysis  of  Industrial  Experiments:  Hafner  Pub- 
lishing Co.,  New  York,  N.Y. , 1956,  p.  426. 
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The  following  example  illustrates  the  use  of  one  of  the  subgroups  W(ABC) , 
X(ABC) , or  Y(ABC)  to  design  a one-third  replicate  of  a three-factor,  three- 
level  factorial  experiment.  Two  factors  are  to  be  studied  in  a filtration 
experiment  to  estimate  their  effects  upon  the  time  required  to  filter  a unit 
weight  of  a flotation  concentrate.  Factor  Xa  is  the  amount  of  flocculant 
added,  and  the  levels  are  0,  0.5,  1.0  pound  per  ton  of  concentrate.  Factor  Xb 
is  the  type  of  filter  cloth  used,  and  three  types  are  included  in  the  investi- 
gation. In  addition,  it  is  desired  to  compare  the  performances  of  three  men 
who  are  being  trained  to  make  the  tests  for  a more  extensive  investigation. 

The  men  are  regarded  as  three  levels  of  a personnel  factor,  Xc.  The  only 
interaction  effect  expected  to  exist  is  that  between  factors  Xa  and  Xb,  and 
only  the  lowest  order  component  coefficient,  AB,  is  expected  to  be  important. 

A system  of  confounding  based  upon  subgroup  W(ABC)  has  the  interaction  effects 
coefficient  BC  confounded  with  the  main  effect  coefficient  A.  Otherwise,  the 
main  effect  coefficients  and  the  two  interaction  coefficients  AB  and  AC  are 
confounded  only  with  unimportant  higher  order  interaction  effect  coefficients. 

A system  of  confounding  based  upon  subgroup  X(ABC)  has  AC  confounded  with  B, 
and  a system  based  upon  subgroup  Y(ABC)  has  AB  confounded  with  C.  In  the 
present  example,  AB  must  not  be  confounded  with  a main  effect  coefficient. 
Therefore,  either  subgroup  W(ABC)  or  X(ABC)  may  be  selected  for  the  design  of 
a one-third  replicate  experiment.  The  subgroup  X(ABC)  is  selected  for  this 
example. 

The  system  of  confounding  for  this  experiment  is  determined  in  a way  simi- 
lar to  that  for  two-level  factorial  designs.  Each  coefficient  in  the  set  of 
defining  contrasts  is  multiplied  by  A,  then  by  B,  and  so  on,  until  all  27 
coefficients  have  been  located  in  a set  of  three  aliases.  With  subgroup  X(ABC) 
as  the  set  of  defining  contrasts,  the  complete  system  of  confounding  is  as 
follows : 


I, 

AB2  C, 

A2  BC2  ; 

A, 

A2B2C, 

BC2  ; 

B, 

AC, 

A2  B2  C2  ; 

c. 

AB2  C2  , 

A2B; 

A2, 

B2C, 

ABC?  ; 

B2, 

ABC, 

A2  C2, 

c2. 

AB2  , 

A2BC; 

AB, 

A2C, 

B2C?  ; 

BC, 

AC2  , 

A2B2. 

The  tests  in  the  principal  block  for  this  system  of  confounding  are  found 
by  comparison  with  the  defining  contrasts.  For  three-level  designs,  test 
aybyCx. ,.kz  is  in  the  principal  block  if  the  comparison  between  that  test  and 
each  of  the  defining  contrasts  APB9cr...Ku  gives  a sum  equal  to  zero  or  a mul- 
tiple of  3;  that  is, 

pv  + qw  + rx  + . . . + uz  = 3n,  n = 0,  1,  2,  ...  . (191) 
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Here,  p,  q,  r,  u,  v,  w,  x,  ....  z can  equal  0,  1,  or  2.  As  in  the  case 

of  two-level  factorial  designs,  the  tests  in  a full  three-factor,  three-level 
design  are  compared  with  the  defining  contrasts  in  alphabetical  order,  taking 
tests  represented  by  one  letter  first,  two  letters  second,  and  so  on.  Two 
independent  tests  in  the  principal  block  are 

aibi»  aaS  * <192) 

The  remaining  six  tests,  other  than  (1),  can  be  found  as  indicated  above,  or 
the  two  tests  (192)  can  be  used  as  follows  to  find  them: 


When  multiplying  test  symbols  for  designs  having  three  or  more  levels, 
the  subscripts  are  treated  as  powers;  for  example,  (a)  (aj  ) = (a1  )(ax  ) = 
a2  = a2 . Group  theory  multiplication  of  test  symbols  in  three- level  factorial 
designs  is  defined  so  that 


= b3  = 


= d„ 


1. 


(193) 


As  a second  example, 

(a2c1)(asb2)  = a4baca  =a3a1b2c1  =a1bac1. 

The  remaining  six  tests  in  the  principal  block  for  the  present  example  are 
found  by  taking  all  possible  products  of  the  two  tests  (192)  and  their  squares; 
that  is, 

(c^bi)3  = aab3, 

<asci  >2  = ca  > 

<aibi  ><aaci  > = bici  ’ (194) 

(a1  bj  )s(a2c1  ) = a2  bgCj  , 

(%  \ Kasci  )2  = a2b!  c3  > 

(ai  bi  )2(asCi  )2  = Vs- 

Therefore,  the  nine  tests  in  the  principal  block  for  a one-third  repli- 
cate of  a full  three-factor,  three-level,  factorial  design  based  upon  the  sub- 
group X(ABC)  are 

(1),  a1  b:  , agt^  , a2b2  , a1  cs  , bx  c:  , b2c2,  a1  bgCj  , andag^Cg.  (195) 

The  other  two  design  blocks  for  the  same  system  of  confounding  are  found  by 
multiplying  the  tests  in  the  principal  block  (195)  by  a:  , and  then  by  bj  to 
obtain 

*4  » asbi  » ci  » ba  » aaca  » *4.  bi  » *4  Va  » aaVi  » bi  ca 

and 

bi  » a2  bj  Cj  , a2  , a1  bj  c2  , b2  Cj  , c2  , ^ cx  , a2  b2  c2 . 


(196) 

(197) 


The  usefulness  of  the  above  procedure  in  finding  the  tests  in  the  principal 
and  other  design  blocks  is  the  reason  for  the  notation  which  has  been  adopted 


i 


I 

! 

t 

I 


I 

i 


i 
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for  the  tests  in  a factorial  design;  that  is,  a^_5  corresponding  to  response 
y^,  test  Cfc-!  corresponding  to  response  yijk»  and  test  (1)  correspond- 

ing to  response  y j. 

The  tests  in  any  one  of  the  blocks  (195),  (196),  or  (197)  can  be  made  to 
obtain  data  from  which  to  calculate  the  coefficients  of  the  linear  and  quad- 
ratic components  of  the  main  effects  of  all  three  factors  and  the  two  interac- 
tion coefficients  AB  and  BC,  provided  all  other  interaction  coefficients  are 
negligible.  In  the  case  where  some  of  the  other  coefficients  are  not  negligi- 
ble, it  is  not  possible,  as  it  was  with  two-level  experiments,  to  calculate 
algebraic  sums  for  each  set  of  aliases  shown  in  the  set  (190)  which  are  appli- 
cable to  all  responses  obtained  in  the  experiment.  The  reason  for  this  is: 

The  response  function  for  the  three-factor,  three- level  experiment  under 
consideration  is 

^ijk  = IZooo(ijk)  + AZaoo(ijk)  + A2zaoo(ijk)  + BZoio(ijk) 

+ . . . + AB2CZiai  (ijk)  + • • • + A2  BCSZS1  g ^ t jkj 

+ . . . + A2B2C2Zs?s(ijk).  (198) 

In  the  system  of  confounding  for  the  present  example,  the  coefficients  I, 

AB^C,  and  A2B<?  constitute  one  set  of  aliases.  For  any  given  response  y^jk* 
the  effects  in  equation  (198)  corresponding  to  these  aliases  are  confounded 
and  may  be  grouped  as  follows: 

IZooo(ijk)  + AB3cZi2i  (ijk)  + ASB<?Zsia(ijk)*  (199) 

The  Z polynomials  in  equations  (198)  and  (199)  for  a three- level  experiment 
vary  for  the  same  response  and  also  vary  from  response  to  response  in  absolute 
magnitude  as  well  as  in  sign.  Consequently,  similar  sums  of  the  same  compo- 
nents such  as  the  sum  (199)  do  not  have  a common  absolute  polynomial  value 
which  can  be  factored  out  of  the  sums  for  all  responses,  leaving  a sum  of 
aliases  common  to  each  response. 

A one-third  replicate  of  a three-level  experiment  may  be  designed  by 
selecting  any  two  coefficients  to  be  confounded,  instead  of  selecting  one  of 
the  subgroups  in  table  7 as  has  been  illustrated  in  the  filtration  experiment. 
One  may  know  that  a particular  interaction  coefficient  is  negligible  and  wish 
to  confound  it  with  one  of  the  main  effect  coefficients.  Suppose  it  is  desir- 
able to  confound  C2  with  AB^C.  The  defining  contrasts  then  are  found  by  multi- 
plying each  of  these  coefficients  by  C;  that  is,  C3  = AB2C32  or  I = AB2^.  The 
third  defining  contrast  is  (AB2^)2  = A2BC.  This  set  is  the  subgroup  W(ABC). 
From  here,  the  complete  system  of  confounding  and  the  tests  in  each  block  are 
found  as  shown  in  the  last  example.  If  two  coefficients  are  selected  as 
aliases,  the  system  of  confounding  should  be  completed  to  determine  whether 
any  important  coefficients  are  aliases  of  each  other  as  a result  of  the  ini- 
tial selection. 

Higher  degrees  of  confounding  can  be  used  in  three-level  designs  as  well 
as  in  two- level  designs.  In  such  cases  it  usually  is  necessary  that  a larger 
number  of  factors  be  involved  and  that  only  main  effects  of  the  factors  be 
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important.  The  coefficients  in  a three-level  design  can  be  listed  in  sub- 
groups in  which  the  number  of  coefficients  is  a power  of  3;  that  is,  3,  9,  27, 
and  so  on.  Any  one  of  these  subgroups  may  be  selected  as  the  set  of  defining 
contrasts  from  which  to  complete  a system  of  confounding  and  find  the  tests  in 
the  principal  and  other  design  blocks. 

The  principles  of  confounding  are  readily  extended  to  factorial  designs 
having  any  number  of  levels  per  factor.  If  each  factor  were  studied  at  four 
levels,  the  group  multiplication  would  be  defined  so  that 

A4  = B4  = C4  * ...  = 1.  (200) 

Any  coefficient  in  a full  four-level  design  can  be  placed  in  a subgroup  in 
which  the  number  of  coefficients  is  a power  of  4;  that  is,  4,  16,  64,  etc. 

There  are  additional  subgroups  in  the  case  of  four-level  designs  in  which 
some  but  not  all  of  the  coefficients  can  be  placed;  for  example,  there  are. 
some  subgroups  containing  two  coefficients.  One  coefficient  is  the  average 
effect  coefficient  I,  which  serves  as  the  identity  element  and  must  be  in  all 
subgroups.  If  there  are  three  factors  to  be  studied,  the  other  coefficient  in 
a subgroup  containing  two  coefficients  can  be  one  of  the  following: 

A2,  B2,  C2,  A2B2  , A2  , Bs(f,  or  A^C2.  (201) 

Additional  special  subgroups  can  be  found  in  which  the  number  of  coefficients 

is  a power  of  2,  other  than  those  already  mentioned  in  which  the  number  is  a 

power  of  4.  For  example,  the  coefficients  of  set  (201)  together  with  I con- 
stitute a subgroup  of  eight  coefficients.  Any  subgroup  can  be  selected  as  a 
set  of  defining  contrasts.  A test  a^^x***^  is  in  the  principal  block  if 
the  comparison  between  that  test  and  each  of  the  defining  contrasts  aPB^C1".  . . Ku 
gives  a sum  equal  to 

pv  + qw  + rx  + . . . + uz  = 4n,  n = 0,  1,  2 (202) 

where  p,  q,  r,  ...,  u,  v,  w,  x,  ...»  z can  equal  0,  1,  2,  or  3.  Again,  ap  is 

regarded  as  aP  and 

a4  = b4  = c4  = d^  = ...  = 1.  (203) 

With  these  relationships,  the  tests  in  the  principal  and  other  design  blocks 
may  be  found. 

In  general,  if  each  factor  is  studied  at  m levels,  the  group  multiplica- 
tion is  defined  so  that 


Am  = Bm  = C™  = Dm  = ...  =1.  (204) 

Any  coefficient  in  a full  m- level  design  can  be  placed  in  a subgroup  in  which 
the  number  of  coefficients  is  a power  of  m,  that  is,  m,  n?  , nt3  , and  so  on.  In 
certain  cases,  a subgroup  may  contain  some  other  number  of  coefficients.  For 
example,  if  in  * 6,  the  two  coefficients  I and  AaB3  form  a subgroup,  and  also 
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the  three  coefficients  I,  A2B4(?,  and  A4B2C4  form  a subgroup.  A test  is  in 
the  principal  block  if  the  appropriate  sums  are  equal  to 

pv  + qw  + rx  + ...  + uz  = mn,  n = 0,  1,  2,  ...»  (205) 

where  p,  q,  r,  ...,  u,  v,  w,  x,  ...,  z can  equal  0,  1,  2,  ...»  (m  - 1). 

Finally, 

*m  - bm  " cm  = ^ = •••  = 1-  <206> 

These  relationships  may  be  used  to  design  any  fractional  factorial  experiment 
in  which  each  factor  is  studied  at  m levels. 

It  also  is  possible  to  apply  the  principles  of  confounding  to  designs  in 
which  the  factors  are  not  all  studied  at  the  same  number  of  levels.  Confound- 
ing in  such  designs  is  complicated,  the  usefulness  is  limited,  and  Davies12 
recommends  that  it  be  avoided  if  possible.  Additional  information  on  confound- 
ing may  be  found  in  the  book  by  Davies. 

As  in  two-level  designs,  estimates  of  coefficients  for  an  m- level  design 
will  be  biased  if  main  effect  coefficients  of  order  m or  larger  and  correspond- 
ing interaction  effect  coefficients  are  important.  If  such  a higher  order 
coefficient,  A^B^C1  ...Ku,  is  important,  the  lower  order  alias  may  be  deter- 
mined by  dividing  each  power,  p,  q,  r,  ...,  u,  by  m.  The  remainders  obtained 
will  be  the  respective  powers  of  A,  B,  C,  . . . , K in  the  alias. 

Latin  Squares.  - There  is  frequent  reference  in  the  literature  to  l^tin 
squares.  A Latin  square  is  simply  one  type  of  fractional  factorial  design  and 
is  used  to  study  three  factors  which  usually  are  qualitative  and  do  not  inter- 
act. If  three  levels  are  studied  for  each  factor,  the  Latin  square  is  a one- 
third  replicate  of  a full  three-factor  design;  if  four  levels,  then  it  is  a 
one-fourth  replicate;  if  five  levels,  a one-fifth  replicate;  and  so  forth. 

The  construction  of  a Latin  square  gives  a simple  method  for  finding  the 
tests  in  the  corresponding  full  factorial  design  which  are  to  be  made.  How- 
ever, this  procedure  does  not  tell  how  the  effects  are  confounded,  and  there- 
fore, it  should  be  used  only  when  interactions  between  the  factors  are  known 
to  be  nonexistent  or  negligible. 

The  following  example  illustrates  the  type  of  experiment  that  may  employ 
a Latin  square  design.  A preliminary  investigation  of  manganese  flotation  is 
to  be  made.  One  purpose  of  this  investigation  is  to  determine  whether  to 
deslime  the  ore  before  flotation.  Factor  Xa  is  the  type  of  ore,  and  the  three 
levels  studied  are  undeslimed  ore  and  the  two  fractions,  sands  and  slimes. 
Factor  is  the  type  of  flotation  cell  to  use,  and  three  levels  to  be  studied 
are  Denver,  Fagergren,  and  Booth  cells.  Factor  Xc  is  a personnel  factor. 

This  experiment  is  the  first  step  in  a more  extensive  investigation  of  manga- 
nese flotation,  and  three  men  are  being  trained  to  make  the  tests.  Therefore, 


12Davies,  0.  L. , Design  and  Analysis  of  Industrial  Experiments:  Hafner  Pub- 
lishing Co.,  New  York,  N.Y. , 1956,  pp.  367-494. 


an  additional  purpose  of  this  experiment  is  to  compare  the  performance  of  the 
three  men.  Interactions  among  these  factors  are  not  expected  to  be  important, 
so  a Latin  square  design  can  be  used  to  reduce  the  number  of  tests  required 
from  27  tests  for  a full  factorial  design  to  9 tests.  A Latin  square  design 
may  include  the  following  nine  tests: 

(1)  blCl 

b2  C2 

®1  C3  bl 

(207) 

a2Cl  a2blC2 

a2b2 

This  arrangement  is  obtained  by  first  listing  the  tests  in  a full  design  for 
the  two  factors  Xa  and  Xfe  in  such  a way  that  the  rows  represent  the  first, 
second,  and  third  levels  of  factor  Xa  and  the  columns  represent  the  first, 
second,  and  third  levels  of  factor  Xb*  Then  the  letter  c is  added  to  the  test 
symbols  in  such  a way  that  each  subscript  appears  once  in  each  row  and  in  each 
column.  By  rearranging  the  order  of  the  subscripts  of  the  letter  c in  the 
rows  and  columns,  11  other  Latin  square  designs  are  possible  for  this  particu- 
lar experiment. 

The  characteristic  property  of  the  Latin  square 
list  just  the  subscripts  of  the  letter  c in  the  same 

is  more  apparent  if  we 
array : 

0 1 

2 

2 0 

1 

(208) 

1 2 

0 

Another  possible  arrangement  of  the  subscripts  is 

0 1 

2 

1 2 

0 

(209) 

2 0 

1 

This  corresponds  to  a Latin  square  design 
includes  the  following  tests: 

for  the  same  experiment  which 

(1)  blCl 

b2C2 

*4  ci  ^ bi  ca 

*iba 

(210) 

a2C2  as\ 

Greco-Iatin  squares  are  used  to  study  four  qualitative  factors  which  do 
not  interact.  A three- level  Greco- Latin  square  is  a one-ninth  replicate  of  a 
full  four-factor  design.  The  corresponding  four-level  design  is  a one- 
sixteenth  replicate.  In  general,  an  m-level  Greco-Latin  square  contains  (l/m)P 
times  the  total  number  of  tests  in  a full  four-factor,  m-level  factorial 
design. 
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A Greco- Latin  square  may  be  constructed  by  selecting  two  Latin  squares 
having  the  property  that  when  they  are  superimposed  upon  each  other,  each  sub- 
script in  one  square  appears  once  with  every  subscript  in  the  other  square. 

Two  Latin  squares  that  have  this  property  are  said  to  be  orthogonal  to  each 
other;  for  example,  squares  (208)  and  (209)  are  orthogonal  and  may  be  super- 
imposed to  give 


0,0  1,1  2,2 

1,2  2,0  0,1  (211) 

2,1  0,2  1,0 

The  first  subscript  in  each  pair  is  obtained  from  the  Latin  square  (209)  and 
the  second  subscript  is  from  the  Latin  square  (208).  Now,  let  the  first  sub- 
scripts be  those  of  the  letter  c and  the  second  subscripts  be  those  of  the 
letter  d.  The  array  (211)  then  represents  the  following  nine  tests: 

(1)  ^c^  bscsds 

^ bi  c2  / (212) 

asbl  d2  asbSCl 

This  method  of  constructing  the  above  square  gives  a design  from  which  the 
main  effects  of  all  four  factors  can  be  estimated. 


As  the  number  of  levels  per  factor  increases,  it  is  possible  to  find  more 
than  two  Latin  squares  which  are  mutually  orthogonal  to  each  other.  Such 
squares  form  an  orthogonal  set.  To  illustrate  the  usefulness  of  such  squares, 
consider  a metallurgical  process  which  involves  four  sources  of  ore,  Xa;  four 
sources  of  reagent,  X^;  four  men,  Xc;  operating  four  similar  sets  of  equipment, 
XdJ  and  four  analysts,  Xe,  assaying  the  products  of  the  process.  A full  facto- 
rial design  to  test  five  factors  each  at  four  levels  requires  1,024  tests. 
However,  three  mutually  orthogonal,  four-level  I^tin  squares  may  be  used  to 
reduce  the  number  of  tests  to  16.  The  following  three  squares 

0123  0123  0123 


10  3 2 


2 3 0 1 


2301  3210 

3210  1032 

may  be  superimposed  to  give 


0,0,0 

1,2,3 

2.3.1 

3.1.2 


1,1,1 

0,3,2 

3,2,0 

2,0,3 


2,2,2 

3,0,1 

0,1,3 

1,3,0 


3 2 10 
10  3 2 
2 3 0 1 


(213) 


3,3,3 

2,1,0 

1,0,2 

0,2,1 


(214) 


This  array 

is  called 

a hyper  Greco-Latin 

square  and  represents  the  following 

16  tests: 

(1) 

bi  S *1  S • 

MM 

b3C3d3e3 

MMs 

MM 

MC 3 ^ 

MM  <215> 

as  C2  d3  ^ 

MM 

MM 

a2h3°l  S 

SMS 

a3blC2e3 

MM 

a3b3d2  ^ 

These 

tests  can 

be  made  to  estimate 

the  main  effects 

of  each  of  the  five 

factors  over  the  range  of  four  levels  for  each  factor.  The  data  obtained  can 
serve  to  identify  the  factor  or  factors  that  require  more  detailed  investiga- 
tion if  the  process  variability  is  unacceptable. 

Several  examples  of  Latin  squares  containing  up  to  12  levels  per  factor 
and  sets  of  orthogonal  Latin  squares  containing  up  to  9 levels  per  factor  are 
given  in  a book  by  Fisher  and  Yates.13  Examples  also  are  given  in  books  by 
Davies14  and  by  Cochran  and  Cox.15 

Randomised  Designs 

The  statistical  validity  of  results  obtained  from  any  experiment  depends 
upon  the  tests  being  made  in  a random  order.  If  the  tests  are  not  made  in 
random  order,  a trend  may  appear  in  the  results  as  a consequence  of  the  order 
in  which  the  tests  are  made.  Such  a trend  due  to  variation  in  an  unobserved 
factor  may  contribute  to  the  estimates  of  some  of  the  important  factor  effects 
for  example,  wear  of  equipment  can  produce  a time  trend  in  results  obtained. 

Since  uniform  test  conditions  cannot  always  be  guaranteed,  the  tests  must 
be  made  in  a random  order  to  minimize  the  effects  of  extraneous  factors. 

There  are  several  ways  to  randomize  the  order  in  which  tests  are  made. 

One  way  is  simply  to  randomize  all  the  tests  in  an  experiment;  for  example, 
suppose  a design  requires  16  different  tests,  and  to  evaluate  experimental 
error,  three  full  replicates  of  the  design  are  to  be  included  in  the  experi- 
ment. The  numbers  from  1 to  48  may  be  assigned  to  the  tests  and  then  random- 
ized to  give  the  order  in  which  to  make  the  tests. 

The  order  in  which  to  make  a series  of  tests  is  random  if,  and  only  if, 
it  is  obtained  by  a random  process  such  as  tossing  coins,  drawing  numbered 
cards  out  of  a bowl,  and  the  like.  Such  processes  may  be  time  consuming,  so 
tables  of  random  digits  have  been  prepared  for  use  in  randomizing  the  order 
of  a series  of  tests.  When  using  such  a table,  a random  method  is  employed 


13Fisher,  R.  A.,  and  Yates,  F. , Statistical  Tables  for  Biological,  Agricultu- 
ral, and  Medical  Research:  Hafner  Publishing  Co.,  Inc.,  New  York,  N.Y. , 
5th  ed. , 1957,  pp.  80-83. 

14Davies,  0.  L. , Design  and  Analysis  of  Industrial  Experiments:  Hafner  Pub- 
lishing Co.,  New  York,  N.Y. , 1956,  pp.  159-186. 

16Cochran,  W.  C. , and  Cox,  G.  M. , Experimental  Design:  John  Wiley  & Sons, 
Inc.,  New  York,  N.Y. , 1950,  pp.  86-121. 


1 


p mm*! 


89 


for  selecting  a row  and  a column  in  which  to  find  the  first  digit.  A toss  of  a coin 
then  may  be  used  for  deciding  whether  to  read  to  the  right,  left,  up,  or  down  from 
this  point  to  obtain  a random  sequence  for  the  test  numbers  in  the  experiment. 

The  use  of  a table  of  random  numbers  may  be  illustrated  as  follows:  In  most 
cases,  a metallurgical  experiment  will  consist  of  somewhat  less  than  100  tests,  but 
often  more  than  10  tests.  Therefore,  table  8 was  prepared  to  contain  25  rows  and  25 
columns  of  the  two-digit  numbers  from  00  to  99  arranged  in  a random  order. 


TABLE  8.  - Random  numbers 


01 

02 

03 

04 

05 

06 

07 

08 

09 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 

21 

22 

23 

24 

25 

01 

86 

50 

87 

87 

65 

87 

13 

51 

84 

13 

20 

31 

76 

12 

19 

85 

06 

00 

70 

27 

20 

68 

86 

38 

51 

02 

83 

27 

88 

06 

02 

09 

09 

21 

32 

14 

37 

44 

46 

18 

50 

50 

54 

33 

58 

45 

63 

64 

65 

12 

19 

03 

31 

27 

62 

39 

20 

21 

31 

36 

27 

23 

41 

44 

51 

52 

25 

61 

30 

66 

66 

67 

68 

71 

10 

71 

71 

04 

75 

79 

32 

81 

83 

86 

54 

86 

87 

88 

72 

89 

86 

54 

69 

89 

09 

49 

51 

43 

90 

06 

17 

16 

49 

05 

15 

75 

76 

81 

87 

40 

88 

54 

90 

13 

68 

83 

54 

73 

13 

90 

91 

60 

14 

46 

68 

91 

17 

22 

73 

06 

29 

96 

95 

35 

01 

91 

91 

24 

94 

80 

37 

29 

90 

01 

59 

53 

79 

24 

16 

47 

05 

26 

55 

59 

38 

07 

80 

30 

33 

03 

20 

28 

47 

02 

04 

15 

30 

40 

49 

12 

17 

70 

74 

85 

48 

06 

29 

60 

52 

63 

26 

08 

64 

69 

77 

82 

83 

98 

60 

99 

66 

10 

17 

08 

25 

30 

99 

99 

85 

54 

61 

72 

16 

03 

14 

10 

37 

09 

34 

45 

85 

39 

94 

43 

39 

95 

11 

97 

98 

18 

43 

97 

97 

87 

93 

67 

56 

81 

21 

71 

95 

08 

43 

10 

90 

96 

96 

96 

10 

94 

96 

21 

69 

24 

77 

00 

09 

20 

05 

57 

05 

61 

93 

49 

19 

45 

82 

93 

95 

11 

23 

38 

00 

29 

74 

11 

57 

30 

13 

44 

47 

10 

49 

36 

52 

55 

57 

11 

55 

64 

73 

03 

56 

15 

24 

12 

65 

65 

28 

67 

72 

22 

50 

33 

38 

73 

12 

74 

76 

78 

38 

78 

23 

73 

76 

60 

70 

72 

82 

28 

47 

13 

48 

13 

25 

56 

70 

76 

05 

28 

42 

68 

83 

19 

15 

26 

32 

12 

06 

40 

44 

51 

46 

70 

80 

84 

03 

14 

06 

64 

02 

07 

15 

17 

08 

27 

28 

34 

41 

01 

42 

44 

58 

60 

42 

55 

58 

67 

72 

16 

47 

77 

10 

15 

24 

27 

35 

35 

11 

35 

22 

43 

00 

10 

07 

31 

48 

63 

50 

65 

69 

62 

74 

78 

56 

38 

71 

52 

84 

16 

66 

25 

91 

52 

63 

74 

52 

79 

80 

97 

77 

72 

78 

99 

08 

79 

46 

61 

34 

91 

79 

29 

14 

69 

56 

17 

86 

52 

36 

80 

30 

73 

64 

88 

53 

24 

80 

39 

86 

14 

22 

16 

36 

68 

48 

90 

92 

32 

18 

59 

55 

18 

20 

78 

75 

75 

94 

65 

34 

95 

09 

03 

10 

39 

40 

94 

25 

59 

22 

19 

01 

29 

95 

96 

41 

62 

34 

19 

02 

16 

53 

75 

82 

11 

92 

51 

53 

82 

85 

69 

56 

00 

32 

35 

37 

44 

48 

50 

57 

33 

39 

59 

43 

20 

08 

25 

42 

23 

04 

13 

21 

34 

18 

07 

33 

06 

45 

60 

67 

23 

42 

67 

70 

75 

83 

88 

31 

41 

57 

21 

91 

21 

92 

16 

01 

04 

92 

89 

26 

77 

81 

92 

07 

14 

18 

31 

37 

62 

63 

03 

42 

55 

66 

66 

09 

22 

12 

74 

48 

48 

11 

09 

46 

58 

61 

46 

09 

53 

76 

78 

81 

82 

89 

93 

10 

57 

05 

33 

64 

93 

62 

23 

92 

89 

98 

19 

97 

39 

85 

62 

88 

98 

99 

93 

23 

94 

98 

04 

08 

28 

59 

63 

32 

45 

79 

84 

84 

24 

02 

36 

04 

58 

17 

21 

35 

40 

40 

01 

36 

41 

53 

07 

15 

26 

15 

05 

61 

71 

37 

26 

81 

00 

19 

25 

41 

45 

02 

77 

84 

12 

30 

81 

24 

18 

04 

75 

58 

22 

89 

07 

02 

12 

47 

95 

96 

97 

89 

98 

49 

Now  suppose  16  tests  are  to  be  made.  One  convenient  way  to  select  the  test 
number  is  to  take  any  textbook  that  may  be  handy  and  open  it  to  a random  page.  The 
even-numbered  page  will  always  be  on  the  left-hand  side  of  an  open  book,  so  the  toss 
of  a coin  may  decide  whether  to  read  the  even  or  odd  number.  Let  the  page  number  so 
selected  be  186.  A multiple  of  25  (in  this  case  175)  can  be  subtracted  from  186  to 
obtain  11.  This  will  be  the  row  number.  The  book  may  be  closed  and  opened  again  to 
obtain  the  column  number,  say  18.  Now  further  tossing  of  the  coin  may  determine 
that  the  column  should  be  read  down  to  obtain  the  sequence 

11,  73,  40,  55,  62,  61,  68,  19,  44,  67,  etc.  (216) 


1 


90 


For  convenience,  since  only  16  tests  are  involved,  subtract  a multiple  of  20 
from  each  number  equal  to  or  greater  than  20.  This  gives  the  sequence 

11,  13,  00,  15,  2,  1,  8,  19,  4,  7,  etc.  (217) 

The  numbers  00,  17,  18,  and  19  will  be  ignored.  When  the  bottom  of  the  column 
is  reached,  continue  at  the  top  of  column  19  and  read  down  again.  As  soon  as 
a number  from  1 to  16  has  been  obtained,  it  will  be  ignored  when  it  appears 
later  in  the  sequence.  This  procedure  is  continued  until  the  following 
sequence  is  obtained: 

11,  13,  15,  2,  1,  8,  4,  7,  5,  12,  10,  6,  14,  16,  3,  9.  (218) 

The  tests  are  numbered  before  the  sequence  is  obtained  and  then  made  in  the 
order  indicated. 

When  designing  a fractional  factorial  experiment,  there  are  two  or  more 
design  blocks  of  tests  that  can  be  made.  The  blocks  should  be  numbered,  and 
one  of  the  blocks  then  should  be  selected  in  a random  way.  Perhaps  more  than 
one  system  of  confounding  will  yield  data  that  are  adequate  for  a particular 
experiment.  In  this  example,  the  number  of  blocks  from  which  one  block  may  be 
located  is  increased.  Latin  squares  are  a special  case  of  the  latter 
situation. 

With  more  than  three  levels  per  factor  in  a Latin  square,  the  number  of 
possible  squares  from  which  to  make  a random  selection  becomes  large.  For 
example,  there  are  576  different  four-level  Latin  square  designs  and  161,280 
five- level  Latin  square  designs.  The  following  procedure  can  be  used  to  make 
a random  selection  from  these  designs.  There  are  four  choices,  0,  1,  2,  or  3 
for  the  first  value  in  the  first  row  of  the  four-level  Latin  square.  Once 
this  choice  is  made,  there  are  three  choices  remaining  for  the  second  value. 
Then  there  are  two  choices  for  the  third  value,  and  the  fourth  value  is  deter- 
mined by  the  first  three  choices.  Now,  there  are  three  choices  for  the  first 
value  in  the  second  row,  and  there  may  be  two  or  three  choices  for  the  second 
va lue. 

This  procedure  is  continued  until,  finally,  all  values  in  the  last  row 
are  determined  by  the  values  previously  chosen.  Each  location  in  a Latin 
square  must  be  considered  carefully  because  sometimes  the  number  of  choices 
depends  upon  the  specific  values  which  have  been  obtained  previously.  The 
table  of  random  numbers  can  be  used  to  select  the  values  for  each  location. 
Since  there  are  only  four  values,  the  digits  may  be  read  singly  rather  than 
in  pairs.  First  let  4 = 0,  5 = 1,  6 = 2,  7 = 3,  and  ignore  digits  8 and  9. 
Then  based  on  a random  selection  begin  in  table  8,  row  14,  column  03  and  read 
to  the  right.  The  sequence  of  digits  beginning  0,2,  0,7,  selects  the  follow- 
ing values  for  the  first  row: 

0,  2,  3,  1.  (219) 

Continuing  in  this  way,  the  next  digits  in  the  sequence,  5,  1,  _7»  0 select 
the  values  for  the  second  row: 
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1,  3,  0,  2.  (220) 

The  digits  8,  2,  7,  2,  8,  3,  4 select  the  first  two  values,  and  the  third  row 
then  is 

2,  0,  1,  3.  (221) 

Finally,  the  fourth  row  must  be 


I 


a 


3,  1,  2,  0.  (222) 

The  digits  that  are  underlined  are  those  which  actually  determine  the  corre- 
sponding underlined  values  in  each  row. 

The  more  information  one  wishes  to  obtain  from  an  experiment,  the  less 
opportunity  there  may  be  to  randomize  the  experiment.  If  only  the  main 
effects  of  the  factors  are  important,  there  may  be  many  Latin  square  designs 
from  which  to  choose.  If  a particular  system  of  confounding  were  required, 
there  may  be  only  two  or  three  design  blocks  available.  When  a full  factorial 
design  is  necessary,  there  is  only  one  set  of  tests.  In  all  cases,  the  tests 
that  are  selected  or  determined  by  a design  should  be  made  in  a random  order. 

Randomized  Blocks 

An  ideal  experiment  would  be  one  in  which  the  factors  to  be  studied  could 
be  controlled  exactly  at  the  levels  specified  by  the  design  and  all  other  test 
conditions  could  be  maintained  at  constant  levels.  There  would  be  no  need  for 
randomization  in  research  conducted  under  such  conditions.  In  actual  practice, 
however,  this  is  rarely  possible. 

A set  of  tests  that  can  be  made  under  closely  controlled  conditions  is 
called  a block  of  tests.  A man  may  be  able  to  make  eight  tests  in  1 day. 

Then  those  tests  constitute  a block  of  eight  tests.  It  may  be  possible  to 
place  four  ore  samples  in  a furnace  at  one  time.  The  four  samples  treated 
during  a single  furnace  campaign  constitute  a block  of  four  tests.  If  eight 
samples  are  to  be  treated  during  two  different  campaigns,  the  eight  tests  con- 
stitute two  blocks  of  four  tests.  The  tests  within  any  block  of  tests  should 
be  made  in  a random  order. 

Randomizing  all  tests  in  an  experiment  is  not  always  sufficient.  Con- 
sider a design  consisting  of  five  tests  and  an  experiment  in  which  each  of  the 
design  tests  is  to  be  made  twice.  A random  order  in  which  to  make  the  10 
tests  might  be 

8,  6,  10,  5,  7,  2,  4,  3,  9,  1.  (223) 

Test  6 will  be  the  same  as  test  1,  7 the  same  as  2,  and  so  on.  The  order  of 
set  (223)  may  be  rewritten  as  follows: 


3,  1,  5,  5,  2;  2,  4,  3,  4,  1 


(224) 
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Now,  suppose  it  were  possible  to  make  only  five  tests  at  a time.  The 
primary  objective  of  the  experiment  is  to  obtain  comparisons  between  the  five 
different  tests.  The  purpose  of  duplicate  tests  is  to  obtain  an  estimate  of 
experimental  error  and  to  determine  which  of  the  comparisons  are  significant 
when  compared  with  experimental  error.  The  order  of  set  (224)  has  the  disad- 
vantage that  test  4 is  not  included  in  the  first  block  of  five  tests.  Also, 
test  5 is  not  included  in  the  second  block.  It  is  preferable  to  include  all 
five  design  tests  in  each  block.  An  example  of  an  order  in  which  the  five 
tests  are  randomly  arranged  within  each  block  is  the  following: 

2,  1,  5,  4,  3;  5,  2,  3,  1,  4.  (225) 

The  above  is  called  a randomized  block  design.  A design,  whether  full  facto- 
rial, fractional  factorial,  or  another  type,  requires  a specified  set  of  tests, 
and  this  set  is  a replicate  of  the  design.  The  requirement  of  a randomized 
block  design  is  that  each  block  should  contain  one  or  more  complete  replicates 
and  that  the  tests  within  each  block  should  be  made  in  a random  order. 

Incomplete  Randomized  Blocks 

It  is  desirable  to  make  all  comparisons  between  tests  at  different  factor 
levels  by  means  of  data  from  tests  within  a single  experiment  block.  This 
reduces  the  chance  for  variations  in  extraneous  factors  from  one  block  to 
another  to  influence  the  estimates  of  the  important  factor  effects.  However, 
it  is  not  always  possible  to  include  a complete  replicate  of  a design  in  one 
block  of  tests.  Such  a situation  is  considered  in  the  sections  on  confounding 
where  the  tests  of  a full  factorial  design  are  to  be  divided  among  two  or  more 
experiment  blocks  of  tests.  In  each  of  these  experiments  a fractional  facto- 
rial design  consists  of  one-half,  one-third,  one-fourth,  and  so  on,  of  the 
tests  in  the  corresponding  full-factorial  design. 

A situation  may  arise  in  which  it  is  desirable  to  include  some  other  frac- 
tion of  the  tests  in  a block.  Suppose  an  experiment  were  designed  to  investi- 
gate five  levels  of  a single  factor  and  for  practical  reasons  it  is  possible 
to  assure  uniform  conditions  within  a block  only  if  each  block  contains  a max- 
imum of  four  tests.  Comparisons  between  tests  at  all  five  levels  can  be 
obtained  from  two  blocks  of  tests.  One  block  can  consist  of  tests  aQ,  , 

and  a , and  the  second  block  can  consist  of  tests  aQ , , a2 , and  a4  . Compari- 

sons between  tests  a , Sj  , and  as  can  be  made  by  means  of  the  data  from  both 
blocks  of  tests.  Comparisons  between  test  a3  and  the  first  three  tests  can  be 
made  with  the  data  from  the  first  block,  and  comparisons  between  test  a4  and 
the  first  three  tests  can  be  made  with  the  data  from  the  second  block.  Test 
a can  be  compared  with  test  a4  indirectly  through  comparisons  with  the  first 
tnree  tests.  Comparisons  between  the  various  tests  are  made  by  studying  the 
differences  in  the  responses  obtained.  In  the  last  example,  the  responses  for 
tests  a3  and  are  each  compared  with  the  average  response  for  the  first 
three  tests  in  the  appropriate  block.  These  two  blocks  of  tests  constitute  an 
incomplete  randomized  block  design. 

If  all  comparisons  are  to  be  made  with  the  same  degree  of  precision,  it 
Is  necessary  to  use  a balanced  incomplete  randomized  block  design.  In  the  two 


blocks  given  above,  tests  aQ  and  a1  appear  together  in  both  blocks,  so  do 
tests  a0  and  as  and  also  tests  aj  and  as . Tests  a and  a3  appear  together 
only  in  the  first  block,  tests  aQ  and  a4  only  in  the  second  block,  and  so  on. 


Tests  a3  and  a*  do  not  appear  together  in  either  block.  For  an  incomplete 


randomized  block  design  to  be  balanced,  every  possible  pair  of  tests  should 
appear  together  in  a block  the  same  number  of  times  in  the  design.  A balanced 
design  for  the  experiment  to  test  five  levels  of  a factor  in  blocks  of  four 

tests  is  given  in  table  9.  Every  pair  of  tests  appears  in  a block  three  times 

in  this  design.  For  example,  tests  a0  and  a1  both  appear  in  blocks  1,  2,  and 

3;  tests  aQ  and  a3  both  appear  in  blocks  1,  2,  and  4,  and  so  on. 


TABLE  9.  - A balanced  incomplete  randomized  block  design  for 
an  experiment  consisting  of  five  tests  in  which 
four  tests  are  to  be  made  in  each  block 


Block 


The  advantage  of  a design  which  is  symmetrical  with  respect  to  a diagonal 
of  the  array  as  shown  in  table  9 is  that  the  same  statistical  procedures  can 
be  used  to  compare  the  effects  of  different  tests  corrected  for  differences 
between  blocks,  and  also  to  compare  the  effects  of  different  blocks  corrected 
for  differences  between  the  tests.  However,  not  all  balanced  incomplete  ran- 
domized block  designs  are  symmetrical. 


The  number  of  blocks  in  a balanced  design  may  be  larger  than  in  the  exam- 
ple given.  Suppose  a complete  replicate  requires  eight  tests  and  it  is  possi- 
ble to  make  only  six  tests  in  each  block.  There  are  28  blocks  in  a balanced 
design  for  this  experiment.  This  number  is  found  by  determining  how  many  dif- 
ferent combinations  of  six  tests  can  be  obtained  from  the  eight  tests  in  the 
replicate.  The  following  combinatorial  formula  is  used: 


n(n  - 11 


n(n  - 1) 


rl (n  - r)I 


(n  - r) I 


(226) 


The  letter  n represents  the  number  of  tests  in  the  complete  replicate,  and  r 
represents  the  number  of  tests  in  a block.  The  symbol  ni  represents  the 
product  of  all  Integers  from  1 to  n and  equals  n(n  - l)(n  - 2). . .(3)(2)(1). 
The  second  form  of  equation  (226)  is  used  if  r is  less  than  (n  - r);  the 
third  form  is  used  if  (n  - r)  is  less  than  r. 


Youden  squares  and  lattice  squares,  which  also  are  discussed  in  the  liter- 
ature, are  special  cases  of  Incomplete  randomized  block  designs.  Additional 
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information  on  incomplete  randomized  block  designs  can  be  found  in  the  books 
by  Fisher  and  Yates,16  Davies,17  and  Cochran  and  Cox.18 

Composite  Designs 

The  degree  of  the  polynomial  components  of  the  main  effects  which  can  be 
estimated  by  using  the  factorial  designs  thus  far  discussed  is  limited  by  the 
number  of  levels  associated  with  each  factor  in  the  experiment.  For  certain 
problems  this  limitation  can  be  overcome  by  the  use  of  composite  designs. 

These  avoid  the  considerable  extra  labor  involved  in  the  use  of  a regular  fac- 
torial experiment  with  more  factor  levels;  Composite  designs  are  formed  by 
adding  extra  tests  at  proper  factor  levels  to  a factorial  design  too  small  to 
estimate  the  effects  of  interest. 

Much  work  has  been  done  in  the  development  of  composite  designs  for  the 
estimation  of  all  second-degree  terms.19 "33  The  estimation  of  the  correspond- 
ing second  order  coefficients  is  the  major  problem.  The  procedure  for  making 
composite  designs  for  the  estimation  of  second-order  coefficients  is  to  first 
choose  a two-level,  full-factorial  or  fractional-factorial  design  that  will 
provide  an  estimate  of  all  first  order  coefficients  and  such  interaction  coef- 
ficients of  second  order  as  are  presumed  to  exist.  The  design  so  chosen  then 
is  supplemented  by  additional  tests  that  will  make  it  possible  to  estimate  the 
second  order  quadratic  coefficients. 

The  procedure  will  be  illustrated  on  the  basis  of  the  two-level,  three- 
factor  full-factorial  copper  leaching  experiment  previously  discussed  in  the 
section  on  two- level  factorial  designs.  The  response  for  this  experiment  was 
recovery  of  copper,  and  the  factors  were  percent  solids,  Xa,  at  25  and  35  per- 
cent solids;  leaching  time,  Xb,  at  30  and  40  minutes;  and  temperature,  Xc,  at 
25°  and  35°  C.  The  design  units  for  these  factors  were  10  percent  solids,  10 
minutes,  and  10°  C. , respectively.  The  base  levels  were  30  percent  solids,  35 
minutes'  leaching  time,  and  30°  C.  leaching  temperature.  In  terms  of  design 
units  the  diagram  for  this  experiment  was  shown  in  figure  8. 


16Fisher,  R.  A.,  and  Yates,  F.,  Statistical  Tables  for  Biological,  Agricultural, 
and  Medical  Research:  Hafner  Publishing  Co.,  Inc.,  New  York,  N.Y. , 5th 
ed. , 1957,  pp.  84-85. 

l7Davies,  0.  L. , Design  and  Analysis  of  Industrial  Experiments:  Hafner  Pub- 
lishing Co.,  Inc.,  New  York,  N.Y. , 1956,  pp.  199-246. 

18Cochran,  W.  C. , and  Cox,  G.  M. , Experimental  Design:  John  Wiley  & Sons, 

Inc.,  New  York,  N.Y. , 1950,  pp.  259-390. 
l9Box,  G.  E.  P. , and  Wilson,  K.  B. , On  the  Experimental  Attainment  of  Optimum 
Conditions:  Jour.  Royal  Statistical  Soc. , series  B (Methodological),  vol. 
13,  No.  1,  1951,  pp.  1-45. 

30  Box,  G.  E.  P. , The  Exploration  and  Exploitation  of  Response  Surfaces: 

Biometrics,  March  1954,  pp.  16-60. 

31  Davies,  0.  L. , The  Design  and  Analysis  of  Industrial  Experiments:  Oliver 

and  Boyd,  London,  1954,  pp.  532-551. 

33Andersen,  S.  L. , Statistics  in  the  Strategy  of  Chemical  Experimentation: 

Chem.  Eng.  Prog.,  vol.  55,  No.  4,  April  1959,  pp.  61-67. 
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The  coefficients  that  could  be  obtained  from  the  design  of  figure  8 are 

I,  A,  B,  C,  AB,  AC,  BC,  and  ABC.  (227) 

If  it  were  assumed  that  only  second-order  coefficients  were  important,  the 
complete  group  of  coefficients  that  would  be  desired  are 

I,  A,  A2,  B,  B2,  C,  C2,  AB,  AC,  and  BC.  (228) 

There  are  various 
ways  in  which  the 
basic  design  of  fig- 
ure 8 may  be  supple- 
mented with  addi- 
tional tests  to 
estimate  the  quad- 
ratic effects  and 
obtain  the  coeffi- 
cients A2  , B2  , and 
& . One  possible 
arrangement  is  to 
make  a test  at  the 
center  of  the  basic 
design  and  two  tests 
along  each  of  the 
coordinate  axes  of 
the  design  spaced  at 
equal  intervals  in 
opposing  directions 
from  the  center. 

This  makes  a total 
of  seven  additional 
tests.  This  arrange- 
ment of  tests  is 
shown  in  figure  9. 

The  factor 
levels  in  terms  of 
design  units  for  the 
eight  tests  of  the 

two-level  factorial  design  in  figure  9 are  shown  in  the  first  eight  lines  of 
table  10.  The  additional  seven  tests  for  the  composite  design  are  shown  in 
figure  9 by  the  numbers  9,  10,  . . .,  15.  If  the  letter  d with  an  appropriate 
subscript  represents  the  axial  distances  of  the  tests  from  the  center  of  the 
design,  the  factor  levels  then  will  be  shown  in  the  latter  half  of  table  10. 

The  values  to  be  given  da,  dj,,  and  dc  will  be  discussed  later. 

The  coefficients  in  set  (228)  can  be  determined  from  data  obtained  from 
the  15  tests  listed  in  table  10  and,  therefore,  in  12  less  tests  than  a three- 
level,  full-factorial  experiment  would  require.  However,  if  one  or  more  of 
the  interaction  coefficients,  AB,  AC,  or  BC,  can  be  considered  negligible,  the 


u 
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FIGURE  9.  - A Composite  Design  Based  Upon  a 2 x 2 x 2 
Factorial  Experiment. 
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remaining  coefficients  of  set  (228)  could  be  obtained  by  9 tests  of  a one-third 
replicate  of  a three- level  design  instead  of  the  15  tests  required  by  the  com- 
posite design.  The  cases  in  which  the  one-third  replicate  of  a three-factor 
design  could  be  used  in  preference  to  a composite  design  now  will  be  discussed. 


TABLE  10.  - Factor  levels  in  terms  of  design  units  for  a 
three-factor  composite  design  for  the  esti- 
mation of  all  second-degree  effects 


Test 

Factor  levels  in  design  units 

xa 

Xb 

Xc 

(1) 

-1/2 

-1/2 

-1/2 

a 

+1/2 

-1/2 

-1/2 

b 

-1/2 

+1/2 

-1/2 

ab 

+1/2 

+1/2 

-1/2 

c 

-1/2 

-1/2 

+1/2 

ac 

+1/2 

-1/2 

+1/2 

be 

-1/2 

+1/2 

+1/2 

abc 

+1/2 

+1/2 

+1/2 

9 

-da 

0 

0 

10 

da 

0 

0 

11 

0 

~db 

0 

12 

0 

+db 

0 

13 

0 

0 

"dc 

14 

0 

0 

+dc 

15 

0 

0 

co 

The  importance  of  second-order  coefficients  and  the  unimportance  of  one 
of  the  second-order  interaction  coefficients  in  the  region  of  interest  may  be 
known  without  having  made  a two- level  experiment  in  the  region.  In  such  cases 
a one-third  replicate  of  a three-level  design  is  appropriate.  If  the  informa- 
tion about  the  importance  of  the  second  order  coefficients  were  determined 
from  a two-level  design  in  the  region  of  interest  and  if  it  were  practical  to 
reduce  the  design  units  by  one-half,  three  of  the  tests  from  the  two- level 
design  plus  six  additional  tests  could  be  used  to  form  a one-third  replicate 
design.  Hiis  also  may  be  done  if  it  is  permissible  to  double  the  range  of  the 
experiment  in  one  direction  for  one  or  more  of  the  factors  and  to  reduce  the 
design  units  by  one-half  for  the  remaining  factors.  In  each  of  these  situa- 
tions involving  a three-factor  problem,  the  total  number  of  tests  necessary  to 
obtain  the  information  is  less  than  that  required  by  a composite  design. 

For  experiments  with  more  than  three  factors,  a composite  design  for  the 
estimation  of  second-order  coefficients  requires  fewer  tests  than  a one-third 
replicate  of  a three- level  design.  For  two-factor  experiments  the  number  of 
tests  in  a composite  design  for  the  estimation  of  all  second  order  coeffi- 
cients is  equal  to  the  number  of  tests  in  a three-level  full  factorial.  A 
useful  one-third  replicate  of  a two-factor,  three- level  design  is  not  possible. 

Composite  designs  may  be  constructed  for  experiments  in  k factors  in  a 
manner  similar  to  the  example  previously  given  for  three  factors.  An 


appropriate  two-level  factorial  or  fractional-factorial  design  is  chosen,  and 
2k  + 1 supplementary  points  are  added  to  the  design  at  the  center  and  along 
each  design  axis  in  a similar  fashion.  Composite  designs  provide  the  advan- 
tage of  proceeding  with  the  experimental  work  in  natural  stages.  The  facto- 
rial or  fractional-factorial  experiment  can  be  completed  first.  If  the  first 
order  coefficients  are  relatively  large  compared  to  the  interaction  coeffi- 
cients of  second  order,  the  research  worker  can  proceed  with  the  next  stage 
of  his  work.  However,  if  the  interaction  coefficients  of  second  order  are 
relatively  large  (not  necessarily  larger),  compared  to  the  first-order  coeffi- 
cients, the  research  worker  then  can  add  the  necessary  supplementary  points  to 
determine  the  quadratic  coefficients.  This  advantage  is  especially  helpful  in 
the  determination  of  optimum  conditions  by  the  steepest  ascent  method. 

It  must  be  pointed  out  that  in  composite  designs  for  the  estimation  of 
coefficients  as  high  as  the  second  order  some  of  the  coefficients  that  it  is 
desired  to  estimate  will  be  biased  by  higher  order  aliases  if  some  of  these 
aliases  are  not  negligible.  Third-order  coefficients  are  the  most  likely  to 
be  troublesome,  and  if  some  of  them  are  not  negligible,  the  extent  of  the  bias 
will  vary  with  the  type  of  the  composite  design.  The  choice  between  a compos- 
ite design  and  a higher  level  factorial  must  be  based  upon  an  evaluation  of 
the  importance  of  conserving  experimental  effort  and  the  consequences  of  being 
wrong  if  higher  order  coefficients  are  actually  more  important  than  previously 
estimated  or  stipulated. 

Orthogonal  Composite  Designs 

Orthogonality  previously  has  been  set  forth  as  a desirable  characteristic 
of  experiment  designs,  because  such  designs  provide  effect  estimates  which  are 
uncorrelated  with  each  other  in  that  the  variations  of  the  estimates  of  the 
individual  effects  within  the  experimental  region  are  not  artificially  related 
due  to  the  manner  of  design.  Composite  designs  may  be  arranged  so  that  orthog- 
onality of  the  estimated  effects  is  thereby  insured  and  such  designs  are 
called  orthogonal  composite  designs.  Orthogonal  composite  designs  for  the 
estimation  of  second-order  coefficients  also  provide  a minimum  amount  of  possi- 
ble bias  from  third-order  coefficients  as  compared  to  the  effects  of  bias  in 
composite  designs  \rfiich  are  not  orthogonal. 

Orthogonality  in  composite  designs  for  the  estimation  of  quadratic  coeffi- 
cients is  obtained  by  the  proper  spacing  of  the  extra  points  on  each  axis  of 
the  design.  The  composite  design  shown  in  figure  9 can  be  made  orthogonal  by 
making  tests  9,  10,  11,  12,  13,  and  14  at  the  same  distance  from  the  center  of 
the  design  in  terms  of  design  units;  that  is,  the  factor  levels  for  these 
tests  as  shown  in  table  10  must  be  such  that  da  =>  dfc  = dc  = d,  when  d is  a 
properly  chosen  constant.  For  two-level  designs  in  which  the  design  unit  is 
one-half  the  distance  between  the  two  levels  of  a factor,  Box  and  Wilson83 
have  listed  the  proper  values  of  d for  orthogonality.  For  two-,  three-,  and 
four-factor  full  factorials  and  for  a five-factor,  half-replicate  fractional 
factorial  with  the  defining  contrasts  I and  ABCDE,  the  proper  corresponding 
values  of  d are  listed  as  1.000,  1.215,  1.414,  and  1.547,  respectively. 


23 Work  cited  in  footnote  19,  p.  37. 
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In  this  paper  a design  unit  is  the  distance  between  two  adjacent  levels  of  a 
factor,  and  for  such  a case  the  corresponding  values  of  d are  one-half  those 
given  above;  that  is,  0.500,  0.608,  0.707,  and  0.774.  For  example,  the  values 
of  da,  dj,,  and  dc  in  the  three- factor  composite  design  of  table  10  should  be 

da  ■ dj,  * dc  = 0.608  if  orthogonality  is  desired. 

Nonorthogonal  Composite  Designs 

Orthogonal  composite  designs  that  provide  estimates  of  all  second-order 
coefficients  do  not  estimate  all  of  the  second-order  coefficients  with  the 
same  precision;  that  is,  the  errors  associated  with  the  estimates  are  not 
equal.  Specifically  the  errors  in  the  quadratic  coefficients  and  those  in 
the  second-order  interaction  coefficients  are  respectively  equal,  but  the 
error  in  the  quadratic  coefficients  is  larger  than  the  error  in  the  second 
order  interaction  coefficients.  In  some  research  problems  it  is  desirable 
that  all  second-order  coefficients  be  estimated  as  nearly  as  possible  with 
equal  precision,  and  this  only  can  be  accomplished  at  the  expense  of  not 
having  orthogonality  in  design.  It  also  is  sometimes  desirable  to  use  compos- 
ite designs  in  which  the  extra  tests  are  not  added  to  the  basic  factorial 

design  in  the  symmetrical  fashion  shown  in  figure  9.  Composite  designs  that 
meet  the  requirements  of  these  two  general  situations  are  called  nonorthogonal 
composite  designs.  Problems  such  as  these  can  arise  in  the  determination  of 
optimum  conditions  by  the  path  of  steepest  ascent. 

Orthogonal  composite  designs  are  symmetrical  with  respect  to  the  center 
of  the  design.  Nonorthogonal  designs  may  be  either  symmetrical  or  nonsymme- 
trical.  In  nonorthogonal  symmetrical  composite  designs  for  the  estimation  of 
quadratic  coefficients,  one  or  more  of  the  pairs  of  tests  on  the  factor  axes 
are  placed  at  factor  levels  that  are  farther  from  the  center  of  the  design 
than  necessary  for  obtaining  orthogonality.  However,  when  it  is  desired  to 
estimate  all  of  the  second-order  coefficients  with  equal  errors,  it  also  is 
necessary  to  make  all  pairs  of  tests  on  the  factor  axes  at  specific  factor 
levels  that  are  placed  at  equal  distances  from  the  center  in  terms  of  design 
units.  With  regard  to  the  three-factor  design  shown  in  table  10,  it  is  neces- 
sary that  the  factor  levels  of  tests  9 to  14,  inclusive,  should  be  such  that 
da  * db  = d<.  = d,  where  d is  a certain  value  for  three-factor  designs.  For 
composite  designs  in  which  the  basic  two-level  factorial  design  has  units  that 
are  one-half  the  distances  between  levels  of  the  factors.  Box  and  Wilson24 
have  listed  the  proper  values  of  d necessary  to  obtain  equality  of  errors  for 
all  second-order  coefficients.  For  two-,  three-,  and  four-factor  full  facto- 
rials and  for  a five-factor  half-replicate  fractional  factorial  with  the 
defining  contrasts  I and  ABCDG,  the  proper  corresponding  values  of  d are 
listed  as  2.090,  2.432,  2.799,  and  2.872,  respectively.  For  designs  in  trfiich 
the  units  are  equal  to  the  distances  between  levels  of  the  basic  factorial  the 
values  of  d are  one-half  those  just  given;  that  is,  1.045,  1.216,  1.400,  and 
1.436,  respectively;  for  example,  the  values  of  da,  dj,,  and  dc  in  the  three- 
factor  composite  design  of  table  10  should  be  da  - dj,  ■ dc  ■ 1.216  if  equality 
of  errors  for  all  second  order  coefficients  is  desired. 


24 Work  cited  in  footnote  19,  p.  37. 
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Inspection  of  the  values  of  d given  by  Box  and  Wilson26  for  equality  of 
errors  in  estimates  of  all  second-order  coefficients  shows  that  the  values  of 
d increase  as  the  number  of  factors  increases.  The  values  of  d range  from 
2.090  for  a two  factor  full  factorial  to  2.872  for  a five-factor  half- 
replicate  of  a full  factorial.  The  absence  of  orthogonality  results  in  cor- 
relation between  the  estimates  of  the  quadratic  coefficients,  and  the  amount 
of  such  correlation  increases  with  increasing  values  of  d.  The  amount  of  pos- 
sible bias  from  third-order  coefficients  also  increases.  Davies26  cautions 
against  using  values  of  d as  large  as  3 because  the  magnitude  of  possible  bias 
from  third-order  coefficients  becomes  too  serious.  For  design  units  equal  to 
the  distance  between  levels  of  the  factors,  the  limiting  value  which  may 
cause  trouble  would  be  1.5. 


Nested  Designs 

Nested  designs  generally  are  used  when  successive  sampling  is  necessary 
to  determine  by  subsequent  testing  the  variation  in  some  associated  character- 
istic of  the  material  or  condition  under  study.  Estimates  of  the  variation 
introduced  by  the  successive  stages  of  the  sampling  procedure  usually  are 
desired.  Nested  designs  may  be  either  balanced  or  unbalanced.  When  the  same 
number  of  replications  occur  within  each  replication  of  the  preceding  stage, 
the  design  is  said  to  be  balanced.  An  example  of  a balanced  nested  design  is 
shown  in  figure  7.  Balanced  designs  may  require  a prohibitive  number  of  tests 
to  obtain  an  accurate  estimate  of  the  desired  variation  in  the  first  stage  of 
the  sampling.  Unbalanced  designs  are  often  more  efficient  for  this  purpose, 
and  mathematical  methods  are  available  for  assessing  the  relative  efficiency 
of  possible  arrangements  in  design.27 

Sampling  schemes,  however  simple,  are  experiments  or  fractions  thereof 
with  nested  designs.  All  statistical  methods  are  based  upon  the  proper  manipu- 
lation of  data  that  have  been  obtained  by  some  sampling  procedure.  It  is, 
therefore,  important  in  design  to  take  due  note  of  those  aspects  of  sampling 
that  have  a direct  bearing  upon  any  experiment.  In  some  cases  the  necessary 
sampling  procedures  do  not  enter  into  the  design  of  the  experiment  but  are  a 
mode  of  operation  necessary  to  perform  the  experiment  properly.  However, 
recognition  of  sampling  problems  may  result  in  a design  that  includes  factors 
due  to  sampling  even  though  such  factors  are  not  of  primary  concern.  In  such 
a case  the  design  may  have  a model  that  is  both  crossed  and  nested  in  classi- 
fication. Such  a design  is  shown  in  figure  10. 

Designs  such  as  the  one  in  figure  10  are  sometimes  used  when  it  is  neces- 
sary to  introduce  sampling  procedures  into  a factorial  experiment  in  which  the 
influence  of  two  or  more  factors  of  a treatment  must  be  introduced  in  succes- 
sive stages.  Figure  10  illustrates  a hypothetical  design  that  might  be  used 
in  the  preliminary  investigation  of  alternative  methods  for  the  leach  treat- 
ment of  large  tonnages  of  ore  having  considerable  variation  in  characteristics. 

2BWork  cited  in  footnote  19,  p.  37. 

20Work  cited  in  footnote  21.' 

27 Davies,  0.  L. , Statistical  Methods  in  Research  and  Production:  Hafner  Pub- 
lishing Co.,  New  York,  N.Y. , 1957,  pp.  115-117. 
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There  are  two  blends  of  ore,  and  each  blend  was  considered  to  be  nonhomogene- 
ous  and  hence  was  randomly  sampled  for  three  blend  lots  to  investigate  the 
effect  of  nonhomogeneity.  Each  blend  lot  was  randomly  sampled  to  obtain  three 
leach  lots  which  are  indicated  by  the  three  small  circles  in  each  row  1.  The 
leach  lots  were  subjected  to  one  of  the  three  leach  treatments  corresponding 
to  the  column  in  which  they  are  located.  After  the  leach  treatments  each  of 
the  leach  residues  was  randomly  sampled  for  two  leach  residue  samples,  which 
are  indicated  by  the  small  circles  in  each  row  2.  Duplicate  samples  for  chem- 
ical analysis  were  removed  from  each  of  the  leach  residue  samples,  and  these 
are  shown  by  the  small  circles  in  each  row  3. 

An  inspection  of  the  design  in  figure  10  will  make  clear  that  the  design 
relationships  are  as  follows:  The  analytical  test  samples  are  nested  in  the 
leach  residue  samples;  the  leach  residue  samples  are  nested  in  the  leach  resi- 
dues from  the  leach  lots;  the  leach  lots  are  nested  in  the  blend  lots;  and  the 
blend  lots  are  nested  in  the  blends.  However,  the  variation  between  leach 
lots  from  a particular  blend  lot  is  confounded  with  the  variation  between 
leach  treatments  in  such  a manner  that  their  combined  variation  is  the  varia- 
tion between  the  leach  residues.  Since  each  blend  lot  is  considered  to  be 
well  mixed,  the  variation  between  the  leach  lots  from  a particular  blend  lot 
is  considered  to  be  insignificant  compared  to  the  variation  between  leach 
treatments.  Therefore,  the  variation  between  the  leach  residues  from  a par- 
ticular blend  lot  is  considered  to  be  due  solely  to  the  effect  of  leach  treat- 
ments. The  leach  residue  samples  are  nested  within  leach  residues  specified 
by  a particular  leach  treatment,  a particular  blend  lot,  and  a particular 
blend.  As  a result  of  these  conditions,  the  nesting  of  the  leach  lots  in  the 
blend  lots  is  ignored  for  the  purposes  of  design  and  analysis,  and  the  leach 
residues  are  not  considered  to  be  nested  in  the  blend  lots.  The  blend  lots 
are  cross  classified  with  the  leach  treatments  for  each  separate  blend  of  ore. 
The  blends  of  ore  are  cross  classified  with  the  leach  treatments. 

The  model  for  this  crossed  and  nested  design  may  be  developed  after  the 
factors,  levels,  and  replicates  are  suitably  designated  symbolically.  Let 
the  factor  symbols  Xa,  Xj,,  Xc,  and  Xj,  respectively,  represent  blends  of  ore, 
blend  lots,  leach  treatments,  and  leach  residue  samples.  Let  the  letters  i, 
j,  k,  and  m represent  levels  corresponding  to  the  factors.  Let  the  letter  n 
represent  a numbered  replicate  of  the  test  samples.  The  model  for  the  design 
then  is 


yijkmn  = p.  + *At  + *2j(i)  + *£k  + *£m(ijk) 

+ *1^  + *Ijfc(i)  + *en(ijkm)*  (229) 

A specific  chemical  analysis  may  be  subdivided  into  parts  on  the  basis  of  the 
model  (229).  For  example,  consider  the  chemical  analysis,  y12  a,  obtained 
for  blend  1,  blend  lot  2,  leach  treatment  1,  leach  residue  sample  2,  and  test 
sample  2.  The  chemical  analysis  may  be  written  as 

= H + + *&(!...)  + *Sl  + *.29(181  .) 

+ **i  .i  . + **.ai  .(i  . . . ) + ^Ciaia)’ 


^1  91  98 
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In  this  equation  the  dots  in  the  subscripts  of  some  of  the  terms  are  essential 
in  order  to  precisely  locate  the  individual  numbers  in  the  subscript  with 
respect  to  the  succession  of  letters  and  numbers,  ijkm  = 1212.  It  should  be 
noted  that  the  cross  classification  of  blend  lots  with  leach  treatments  for 
each  separate  blend  of  ore  causes  the  interaction  effect  between  blend  lots 
and  leach  treatments  to  be  nested  within  separate  blends  of  ore. 

The  Importance  of  Experimental  Error  in  Design  Considerations 

In  the  design  of  an  experiment  all  available  information  about  the  nature 
of  the  experimental  errors  that  may  be  encountered  should  be  considered  to 
assure  obtaining  data  that  vrfien  analyzed  will  produce  the  maximum  useful  infor- 
mation for  the  work  performed.  Experimental  error  may  originate  from  several 
sources  and  with  suitable  experimental  designs  may  be  divided  into  components 
corresponding  to  their  origin.  To  illustrate,  the  error  in  an  observation 
might  be  due  to  the  method  of  measurement,  such  as  analytical  error,  and 
also  to  the  manner  of  conducting  the  experiment  in  the  laboratory.  In  most 
cases  an  estimate  of  the  total  error  from  all  such  sources  is  obtained  from 
all  tests  as  the  estimated  experimental  error.  A critical  inspection  of  the 
sources  of  error  is  not  made  unless  this  becomes  essential  in  attempts  to 
reduce  the  size  of  the  experimental  error.  In  this  illustration  a presumed 
high  degree  of  precision  in  a research  laboratory  might  lead  one  to  suspect 
that  errors  in  chemical  analysis  were  responsible  for  a considerable  portion 
of  the  experimental  error.  A resolution  of  this  suspicion  would  be  essential 
in  deciding  how  a reduction  in  experimental  error  could  be  accomplished.  In 
this  case  each  test  would  be  performed  two  or  more  times,  and  two  or  more 
chemical  analyses  then  would  be  made  for  each  test.  This  procedure  makes 
possible  the  separation  of  the  experimental  error  into  two  components. 

Sometimes  a suitable  estimate  of  the  experimental  error  exists  from  pre- 
vious records  or  experience.  In  such  cases  this  estimate  may  be  used  in  the 
evaluation  of  the  accuracy  of  observations  or  the  relative  importance  of  exper- 
imental effects.  When  previous  estimates  of  experimental  error  are  not  suita- 
ble, the  experiment  may  be  designed  so  as  to  obtain  an  estimate  from  the 
experiment.  Sometimes  one  or  more  of  the  possible  highest  order  interactions 
are  known  to  be  nonexistent  or  statistically  unimportant  as  determined  by  the 
analysis  of  previous  data.  If  so,  the  experiment  can  be  designed  so  that  such 
possible  highest  order  interactions  are  not  confounded  with  other  effects,  and 
these  interactions  then  will  provide  an  estimate  of  the  experimental  error 
when  the  data  are  analyzed.  When  all  possible  interactions  are  known  to  exist, 
it  may  be  necessary  to  obtain  an  estimate  of  the  experimental  error  by  making 
two  or  more  replications  of  each  test.  Multiple  replication  also  is  often 
necessary  when  the  number  of  possible  interactions  that  are  nonexistent  are 
too  few  to  give  an  accurate  estimate  of  the  experimental  error.  Whenever  an 
estimate  of  experimental  error  is  obtained  from  an  experiment,  some  considera- 
tion should  be  given  to  the  possibility  of  combining  it  with  previous  esti- 
mates to  get  a more  precise  estimate. 

The  basic  purpose  of  multiple  replication  is  to  increase  the  precision  of 
estimating  effects.  If  this  purpose  is  fulfilled  in  any  experiment, 
increased  replication  for  the  purpose  of  obtaining  a better  estimate  of  the 
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experimental  error  is  an  unnecessary  waste  of  time  and  labor.  The  precision 
with  which  effects  are  estimated  depends  upon  the  relative  sizes  of  the 
effects  to  be  estimated  and  the  experimental  error  involved.  The  smaller  an 
effect  is  when  compared  to  the  experimental  error,  the  better  the  estimate  of 
the  experimental  error  must  be  to  insure  precision  in  estimating  the  effect. 

When  an  experiment  is  designed  by  statistical  methods,  the  experimental 
errors  that  are  associated  with  each  observation  are  assumed  to  be  random 
variables.  For  example,  consider  the  manganese  flotation  experiment  in  which 
the  response  is  manganese  recovery  and  the  factor  Xa  is  conditioning  time  with 
levels  k = 1,  2,  3 corresponding  to  3,  4,  and  5 minutes'  conditioning  time, 
respectively.  The  factor  Xj,  is  temperature  with  levels  m = 1,  2,  3,  4 corre- 
sponding to  30°,  40°,  50°,  and  60°  C. , respectively,  and  the  replicates  are 
r = 1,  and  2.  The  model  for  this  experiment  is 

ykmr  = M-  + *£k  + *2m  + ^km  + *er(km)‘  (230> 

All  of  the  experimental  errors  *6^^)  are  assumed  to  be  random  in  their 
occurrence. 

The  assumption  also  is  made  that  the  random  error  terms  associated  with 
the  manganese  recoveries  of  each  replicate  are  uncorrelated  and  that  the  ran- 
dom error  terms  associated  with  each  level  of  a factor  are  uncorrelated.  This 
means  that  variations  in  test  conditions  that  are  beyond  control  have  caused 
the  individual  errors  in  such  a manner  that  they  are  unrelated  in  their  occur- 
rence. Correlation  between  errors  may  result  from  the  effect  of  some  addi- 
tional factor,  which  has  been  ignored  but  which  has  influenced  the  replicates 
of  a test  or  the  results  of  tests  from  level  to  level  of  a factor  in  the  same 
way.  The  control  or  elimination  of  such  factors  is  not  always  possible.  How- 
ever, the  effect  of  such  factors  upon  the  validity  of  the  subsequent  analysis 
may  be  minimized  by  randomization  in  the  execution  of  the  experiment.  Random- 
ization ensures  that,  in  general,  such  factors  contribute  equally  to  each 
observation. 

In  designing  experiments  it  also  is  important  to  consider  what  may  be 
known  about  the  variance  of  the  error  terms.  Since  the  statistical  concept  of 
variance  has  not  yet  been  introduced,  it  now  will  be  described  with  reference 
to  observations  and  their  errors.  An  understanding  of  this  concept  is  essen- 
tial for  developing  designs  that  will  yield  experimental  data  suitable  for  the 
most  profitable  analysis.  If  several  replicates  of  a single  test  are  made  for 
the  experiment  with  the  model  (230),  the  observations,  yUmr , will  have  a mean, 
y^,  which  is  an  estimate  of  the  true  mean,  Each  observation  will  have  a 

deviation  from  the  true  mean  which  is  *er(jCm)  = ykmr  " Hkm*  If  a large  number 

of  such  replicates  were  made,  the  quantity  £ - p^^/n  as  n becomes 

r = 1 

infinite  would  be  a measure  of  the  true  variation  associated  with  the  observa- 
tions and  is  called  the  variance  of  the  observations.  This  variance  is  repre- 
sented by  the  symbol  ct  3(yyTnT.)  and  also  is  the  variance  of  the  error  terms, 

*er(km)»  When  designing  the  experiment  for  model  (230),  the  assumption  is 
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made,  unless  there  is  contrary  evidence,  that  the  variance  of  the  error  terms 
is  equal  for  all  levels  of  conditioning  time  and  for  all  levels  of  temperature 
When  the  condition  of  equality  in  variance  exists,  the  error  variance  is  said 
to  be  homogeneous. 

In  the  subsequent  analysis  of  the  data  it  is  desirable  that  the  variances 
of  the  mean  values  of  the  observations  for  the  replicates  of  each  test  should 
be  equal.  In  the  case  of  the  model  (230)  the  variance  of  the  mean  value,  y^, 
for  n replicates  of  the  same  test  is  represented  by  a 2(yfcm)»  anc*  *-ts 
relationship  to  ct  2(yicmr)  is 

° 2(ykm)  = 2(ykmr)/n*  <231> 

When  homogeneity  of  variance  in  the  observations  exists,  homogeneity  of  vari- 
ance in  the  observation  means  also  will  exist  for  the  usual  case  of  an  orthog- 
onal design  in  which  all  tests  have  the  same  number  of  replicates.  A condi- 
tion of  nonhomogeneity  in  expected  error  variance  must,  therefore,  be 
considered  when  following  the  principles  of  experimental  design  previously 
set  forth.  An  illustration  of  an  experiment  in  which  the  expected  error 
variance  is  nonhomogeneous  and  the  design  precautions  which  are  useful  for 
obtaining  homogeneous  observation  means  now  will  be  presented. 

Consider  the  research  problem  in  the  flotation  of  manganese  at  the  levels 
of  conditioning  time  and  temperature  previously  given  for  model  (230).  Sup- 
pose that  the  results  of  previous  research  showed  that  the  estimated  variance 
of  the  experimental  error  for  manganese  recovery  with  4 and  5 minutes'  condi- 
tioning time  was  approximately,  and  respectively,  two  and  three  times  the 
variance  with  3 minutes'  conditioning  time;  that  is, 

2<y3mr>  = 30  2(yimr>»  <232> 

2<yamr>  = 2a  2<yimr>-  <233> 

The  variances  of  the  mean  values  of  the  observations  at  the  m'th  level  of  tem- 
perature for  levels  K = 1,  2,  3 of  conditioning  time  according  to  equation 
(231)  are,  respectively. 


2<yim> 

“ a 2(yimr>/ni  » 

(234) 

a 2<y3m> 

= 2(y8mr>/n3‘ 

(235) 

a a<W 

* CT  2<y3mr)/n3’ 

(236) 

wherein  , and  n are  the  number  of  replicates  at  levels  i ■ 1,  2,  3, 

respectively.  For  subsequent  analysis  the  desired  result  is  that 

a 2(ylm)  - a 2(yam)  - a 2(y3m).  (237) 

From  the  relationships  of  equations  (232)  to  (237)  it  may  be  shown  that  (237) 
will  be  so  only  if  ■ 3 and  n3/ng  « 3/2.  Under  the  stipulated  condition 

that  equations  (232)  and  (233)  are  approximately  true,  the  use  of  these  ratios 
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in  replication  at  successive  levels  of  conditioning  time  will  result  in  a 
close  approximation  of  the  relationships  in  (237). 

In  general,  if  there  were  prior  evidence  of  nonhomogeneity  in  variance 
from  level  to  level  of  a factor,  it  should  be  possible  to  arrange  an  experi- 
ment so  that  homogeneity  of  variance  for  the  mean  values  of  the  test  repli- 
cates is  obtained.  Furthermore,  an  experiment  with  proportionate  replication 
from  level  to  level  of  a factor  is  orthogonal,  and  the  effects  may  be  esti- 
mated without  correlation  of  the  effect  estimates. 

There  are  some  cases  of  nonhomogeneity  in  variance  for  which  precautions 
are  not  necessary  because  it  is  possible  and  more  practical  to  use  suitable 
procedures  in  the  analysis  of  the  data  to  correct  the  situation.  The  condi- 
tions for  which  this  is  so  are  those  where  there  is  an  observed  relationship 
between  the  observation  means  and  the  error  variance  of  the  observations.  The 
nature  of  this  relationship  sometimes  may  be  obtained  from  prior  data  by  plot- 
ting the  estimated  error  variance  of  the  observations  against  the  observation 
means.  Homogeneity  of  variance  in  the  means  of  transformed  observations  then 
may  be  obtained  in  subsequent  analysis  by  substituting  a function  of  the 
response  for  the  response  itself.  In  general,  when  there  is  a known  mathemat- 
ical transformation  of  the  response  that  will  stabilize  the  variance  of  the 
transformation  means,  the  design  of  the  experiment  may  be  conducted  by  the 
principles  previously  set  forth  without  any  attempt  to  incorporate  into  the 
design  features  to  correct  nonhomogeneity  of  variance  in  the  subsequent  exper- 
imental results.  Information  concerning  the  types  of  transformations  dis- 
cussed here  may  be  found  in  the  literature.28-31 


28Bartlett,  M.  S. , The  Use  of  Transformations:  Biometrics,  vol.  3,  No.  1, 
March  1947,  pp.  39-52. 

29Kendall,  M.  G. , The  Advanced  Theory  of  Statistics:  Hafner  Publishing  Co., 
New  York,  N.Y. , vol.  2,  3d  ed. , 1951,  pp.  206-209. 

30Davies,  0.  L. , Design  and  Analysis  of  Industrial  Experiments:  Oliver  and 
Boyd,  London,  1954,  pp.  44-45. 

31  Bennett,  C.  A.,  and  Franklin,  N.  L. , Statistical  Analysis  in  Chemistry  and 
the  Chemical  Industry:  John  Wiley  & Sons,  Inc.,  New  York,  N.Y. , 1954, 
pp.  355-357. 
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